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SIM2 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES OF EACH IN 
DL\GNOSIS AND TREATMENT OF OVARIAN, BREAST AND LUNG 

CANCERS 

5 FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to SIM2 polypeptides and polynucleotides and to 
methods of diagnosing and treating ovarian, breast and lung cancers. 

Lung cancer is the primary cause of cancer death among both men and women 
in the United States, The five-year survival rate among all lung cancer subjects, 
10 regardless of the stage of disease at diagnosis, is only 13%. This contrasts with a five- 
year survival rate of 46% among cases in which the disease is still localized. 
However, only 16% of lung cancers are diagnosed at a stage prior to spread of the 
disease. 

Early detection is difficult since clinical symptoms are often not observed until 

15 the disease has reached an advanced stage. Currently, diagnosis is aided by the use of 
chest x-rays, analysis of the type of cells contained in sputum and fiberoptic 
examination of the bronchial passages. Treatment regimens are determined by the 
type and stage of the cancer, and include surgery, radiation therapy and/or 
chemotherapy. In spite of considerable research into therapies for the disease, lung 

20 cancer remains difficult to treat. 

Breast cancer is the most common form of cancer in women, and can also be 
diagnosed in men. Over 200,000 new breast cancer cases are diagnosed each year in 
the United States. In the U.S. today, there are more than two million breast cancer 
survivors, and every woman is at risk. 

25 Molecular biomarkers for breast cancer are of several types. Risk biomarkers 

are those associated with increased cancer risk and include mammographic 
abnormalities, proliferative breast disease with or without atypia, family clustering and 
inherited germ-line abnormalities. Surrogate endpoint biomarkers are tissue, cellular or 
molecular alterations that occur between cancer initiation and progression. These 

30 biomarkers are utilized as endpoints in short-term chemoprevention trials. Prognostic 
biomarkers provide information regarding outcome irrespective of therapy, while 
predictive biomarkers provide information regarding response to therapy. Candidate 
prognostic biomarkers for breast cancer include elevated proliferation indices such as 
Ki-67 and proliferating cell nuclear antigen (PCNA); estrogen receptor (ER) and 
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progesterone receptor (PR) overexpression; markers of oncogene overexpression such 
as c-erbB-2, TGF-a and EGFR; indicators of apoptotic imbalance including 
overexpression of bcl-2 and an increased bax/bcl-2 ratio; markers of disordered cell 
signaling such as p53 nuclear protein accumulation; alteration of differentiation 
signals such as overexpression of c-myc and related proteins; loss of differentiation 
markers such as TGF-b II receptor and retinoic acid receptor; and alteration of 
angiogenesis proteins such as VEGF overexpression [Beenken (2002) Minerva Chir. 
57:437-48]. 

Availability of molecular biomarkers for breast cancer will substantially 
increase diagnostic capabilities and enable the development of prognostic indices, 
which combine the predictive power of individual molecular biomarkers with specific 
clinical and pathologic factors. 

Ovarian cancer, is a significant health problem for women in the United States 
and world wide. Although advances have been made in detection and therapy of this 
cancer, no vaccine or other universally successful method for prevention or treatment 
is currently available. Management of the disease currently relies on a combination of 
early diagnosis and aggressive treatment, which may include one or more of a variety 
of treatments such as surgery, radiotherapy, chemotherapy and hormone therapy. The 
course of treatment for a particular cancer is often selected based on a variety of 
prognostic parameters, including an analysis of specific tumor markers. However, the 
use of established markers often leads to a result that is difficult to interpret, and high 
mortality continues to be observed in many cancer subjects. 

Thus, there remains a need for a practical method of diagnosing lung cancer, 
breast cancer and ovarian cancer as close to inception as possible. In order for early 
detection to be feasible, it is important that specific markers be found and their 
sequences elucidated. 

The Drosophila single minded (sim) gene is the master regulator of fruit fly 
meuro genesis [Thomas (1988); Mambu (1991)]. SIM protein is a transcription factor 
containing a basic helix-loop-helix (bHLH) motif, two PAS (PER/ARNT/SIM) 
domains, and an HST (HIF-o/SIM/TRH) domain [Mambu (1991); Isaac and Andrew 
(1996)]. 

Two mouse homologs of the sim gene (i.e., Siml and Sim2) were cloned. 
Siml maps on mouse chromosome 10 and Sim2 on rriouse chromosome 16 in a region 
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of synteny withHC21 [Fan et al. (1996) Mol. Cell. Neurosci. 7:1-16]. Both mouse 
Siml and Sim2 genes are expressed early [Sim2 from embryonic day 8.0 (E8.0) and 
Siml from day 9.0 (E9.0)] in developing forebrain [Ema et al. (1996) Mol. Cell, Biol. 
16:5865-5875; Fan et al. (1996) supra; MofTett et al. (1996) Genomics 35:144-155; 
5 Yamaki et al. (1996) Genomics 35:136-143] and outside the central nervous system 
(CNS), in somites, mesonephric duct, and foregut (SIMl), in facial and trunk cartilage, 
trunk muscles (Sim2), and in the developing kidney (SIMl and SIM2) [Dahmane et al. 
(1995) Proc. Natl. Acad. Sci, 92:9191-9195; Ema et al. (1996) supra; Fan et al. (1996) 
supra; Moffett et al. (1996) supra]. 

10 In adult mouse, both SIMl and SIM2 are expressed in kidney and skeletal 

muscles, whereas Sim2 is also expressed in the lung (Ema et al. (1996) Biochem. 
Biophys. Res. Conmiun. 218:588-594; Moffett et al. (1996) supra]. 

Recently, the cloning of the cDNAs for two human homologs (SIMl and 
SIM2) of the Drosophila sim gene, and mapping of SIMl to chromosome 6ql6.3-q21 

15 have been reported [Chrast (1997) Genome Research 7:615-624 and Chen et al. 
(1995)]. Northern blot analyses indicated the transcription of several mRNA 
transcripts from the SIM2 gene, including those of 2.7, 3, 4.4 and 6kb. The multiple 
mRNAs may be products of alternative splicing, overlapping transcription, or different 
utilization of 5' or 3' untranslated sequences. At least two different forms of the 

20 human SIM2 gene have been characterized. The long form (GenBank Accession No. 
U80456; SEQ ID NO: 7 is 3921 bp and codes for a protein of 667 amino acid with an 
apparent molecular weight of 74 kD. The short-form (GenBank Accession No. 
U80457; SEQ ID NO: 8) is 2859 bp and codes for a protein of 570 amino acid with an 
apparent molecular weight of 64 kD. 

25 Human SIM-1 and SIM-2, in combination with ARNT, attenuate transcription 

from the hypoxia-inducible erythropoietin (EPO) enhancer during hypoxia. SIM 
protein levels decrease with hypoxia treatment, suggesting a negative feedback 
mechanism. Upregulation and activation of HEF-la is concomitant with attenuation of 
SIM activities [Woods SL., Whitelaw ML., J. Biol. Chem 2002; 277:10236-43]. 

30 PCT Application No. WO 02/12565 discloses the use of SIM2 as a marker and 

possible therapeutic target for specific types of cancer. Increased expression of SIM2 
in specific cancers including colon, prostate and pancreas tumors as compared to 
normal tissues was observed. In accordance, a similar pattern of expression was found 
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by DeYoung and co-worlers [Proc. Natl. Acad. Sci. (2003) 100:4760-5 and Anticancer 
Res. (2002) 22(6A):3 149-57]. Interestingly, neither of the above-publications showed 
an elevated expression of SBM2 in ovarian, lung and breast tumors. 

While reducing the present invention to practice the present inventors 
5 uncovered that elevated levels of SIM2 are associated with ovarian, breast and lung 
tumors, thus showing for the first time that SIM2 can also be used as a marker and 
possible therapeutic target for ovarian, breast and lung tumors. 



SUMMARY OF THE INVENTION 

10 According to one aspect of the present invention there is provided a method of 

diagnosing predisposition to, or presence of ovarian cancer, breast cancer and/or lung 
cancer in a subject, the method comprising determining a level of SIM2 in a biological 
sample obtained fi-om the subject, the level being correlatable with predisposition to, 
or presence or absence of the ovarian cancer, breast cancer and/or lung cancer, thereby 

15 diagnosing predisposition to, or presence of ovarian cancer, breast cancer and/or lung 
cancer in the subject. 

According to further features in preferred embodiments of the invention 
described below, the biological sample is a tissue sample and/or a body fluid sample. 

According to still further features in the described preferred embodiments the 

20 tissue sample is selected fi*om the group consisting of an ovarian tissue, a lung tissue 
and a breast tissue. 

According to another aspect of the present invention there is provided a method 
of treating ovarian cancer, breast cancer and/or lung cancer in a subject, the method 
comprising downregulating expression or activity of SIM2 in a lung tissue, breast 
25 tissue and/or an ovarian tissue, thereby treating the ovarian cancer, breast cancer 
and/or lung cancer in the subject. 

According to still further features in the described preferred embodiments the 
SIM? is selected from the group consisting of SEQ ID NOs: 1, 2, 3, 7, 8 and 9. 

According to still further features in the described preferred embodiments 
30 downregulating expression or activity of the SIM2 is effected by administering to the 
subject: 

(a) a molecule which binds SIM2; 

(b) an enzyme which cleaves SIM2; 



(c) an antisense polynucleotide capable of specifically hybridizing with 
an mRNA transcript encoding SIM2; 

(d) a ribozyme which specifically cleaves SIM2 transcripts; 

(e) a non- functional analogue of at least a catalytic or binding portion of 
SIM2; 

(f) a molecule which prevents SEVI2 activation or substrate binding; 

(g) an siRNA molecule capable of inducing degradation of S1M2 
transcripts; 

(h) a DNAzyme which specifically cleaves SIM2 transcripts or DNA; and 

(i) a molecule which promotes a SIM2-specific immunogenic response. 
According to still further features in the described preferred embodiments the 

molecule which binds SIM2 is an antibody or antibody firagment capable of 
specifically binding the SIM2. 

According to yet another aspect of the present invention there is provided use 
of an agent capable of downregulating SIM2 expression or activity for the treatment of 
ovarian, breast and/or lung cancer. 

According to still further features in the described preferred embodiments the 
agent capable of downregulating SDVI2 activity is an antibody or antibody fragment. 

According to still further features in the described preferred embodiments the 
agent capable of downregulating SIM2 expression or activity is an oligonucleotide. 

According to still further features in the described preferred embodiments the 
oligonucleotide is a single or double stranded polynucleotide. 

According to still further features in the described preferred embodiments the 
oligonucleotide is at least 17 bases long. 

According to still further features in the described preferred embodiments the 
oligonucleotide is hybridizable in either sense or antisense orientation. 

According to still another aspect of the present invention there is provided use 
of a SIM2 detecting agent for detecting ovarian, breast and/or lung cancer. 

According to still further features in the described preferred embodiments the 
agent for detecting ovarian, breast and/or lung cancer is an oligonucleotide . 

According to still further features in the described preferred embodiments the 
agent for detecting ovariaii, breast and/or lung cancer is an antibody or antibody 
fragment. 



According to still further features in the described preferred embodiments the 
agent for detecting ovarian, breast and/or lung cancer is coupled to a detectable 
moiety selected from the group consisting of a chromogenic moiety, a fluorogenic 
moiety, a radioactive moiety and a light-emitting moiety. 

5 According to an additional aspect of the present invention there is provided an 

article-of-manufacture comprising a packaging material and a composition identified 
for treating ovarian, breast and/or lung cancer being contained within the packaging 
material, the composition including, as an active ingredient, an agent capable of 
dowtiregulating SIM2 expression or activity. 

10 According to still an additional aspect of the present invention there is 

provided an isolated polynucleotide comprising a nucleic acid sequence encoding a 
polypeptide being at least 80 % homologous to SEQ ID NO: 39, 40 or 41 as 
determined using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where the gap creation equals 8 and gap 

15 extension penalty equals 2. 

According to still further features in the described preferred embodiments the 
polypeptide is as set forth in SEQ ED NO: 39, 40 or 41. 

According to a further aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence being 80 % identical to 

20 SEQ ID NO: 39, 40 or 41, as determined using the BestFit software of the Wisconsin 
sequence analysis package, utilizing the Smith and Waterman algorithm, where gap 
weight equals 50, length weight equals 3, average match equals 10 and average 
mismatch equals -9. 

. According to still further features in the described preferred embodiments the 
25 iiucleic acid sequence is as set forth in SEQ ED NO: 2 or 3. 

According to yet a further aspect of the present invention there is provided an 
isolated polynucleotide as set forth in SEQ ED NO: 2 or 3. 

According to still a further aspect of the present invention there is provided a 
nucleic acid construct comprising the isolated polynucleotide. 
30 According to still a further aspect of the present invention there is provided an 

isolated polypeptide as set forth in SEQ ID NO: 39, 40 or 41. 

According to still a further aspect of the present invention there is provided a 
method of diagnosing predisposition to, or presence of cancer in a" subject, the method 
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comprising determining a level of SEQ ID NO: 2 and/or 3 in a biological sample 
obtained from the subject, wherein the biological sample is suspected of being a 
cancerous tissue or associated with the cancerous tissue and whereas the level being 
correlatable with predisposition to, or presence or absence of the cancer, thereby 
diagnosing predisposition to, or presence of cancer in the subject. 

According to still further features in the described preferred embodiments the 
determining level of the SEQ ID NO: 2 and/or 3 is effected at an mRNA level. 

According to still further features in the described preferred embodiments the 
determining level of the SEQ ID NO: 2 and/or 3 is effected at a protein level. 
5 According to still further features in the described preferred embodiments the 

determining level of the SEQ ID NO: 2 and/or 3 is effected at a gene amplification 
level. 

According to still a further aspect of the present invention there is provided a 
method of treating cancer in a subject, the method comprising downregulating 

10 expression or activity of SEQ ED NO: 2 and/or 3 in a cancerous tissue, thereby treating 
the cancer in the subject. 

The present invention successfully addresses the shortcomings of the presently 
known configurations by providing SIM2 polypeptides and polynucleotides encoding 
same which can be used to diagnose and treat ovarian and lung cancers. 

15 Unless otherwise defined, all technical and scientific terms used herein have 

the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. In case of conflict, the patent 

20 specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with reference to 
25 the accompanying drawings. With specific reference now to the drawings in detail, it 
is stressed that the particulars shown are by way of example and for purposes of 
illustrative discussion of the preferred embodiments of the present invention only, and 
are presented in the cause of providing what is believed to be the most, useful and 
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readily understood description of the principles and conceptual aspects of the 
invention. In this regard, no attempt is made to show structural details of the invention 
in more detail than is necessary for a fundamental understanding of the invention, the 
description taken with the drawings making apparent to those skilled in the art how the 
5 several forms of the invention may be embodied in practice, 
hi the drawings: 

FIG. 1 is a schematic illustration showing the genomic organization of SIM2 
expression products. Boxes designate exons and lines designate introns. Arrow heads 
designate primers. 

10 FIG. 2 is a histogram showing SIM2 expression in normal and tumor-derived 

lung samples as determined by real time PGR using SEQ ID NO: 20, normalized to the 
housekeeping gene RPS27A (SEQ ID NO: 23). Two independent experiments are 
shown. 

FIG. 3 is a histogram showing SIM2 expression in normal and tumor derived 
15 colon samples as determined by real time PGR using a SIM2 derived fragment (SEQ 
ED NO: 20). 

FIG. 4 is a histogram ishowing SIM2 long and short transcript expression in 
normal and tumor-derived lung samples as determined by real time PGR using a SIM2 
derived fragment (SEQ ID NO: 9) corresponding to coordinates 232- 404 of SEQ ID 
20 Nos 7 and 8. Expression of SIM-2 derived fragment (SEQ ID NO: 9) was normalized 
to the expression of RPS27A (SEQ ID NO: 23) housekeeping gene. Two independent 
experiments are shown. 

FIG. 5 is a histograms depicting SIM2 expression as in Figure 4 on a 0-200 

scale. 

25 FIG. 6 is a histogram showing SIM2 long and short transcript (SEQ ID NOs: 7 

and 8) expression in normal and tumor derived lung samples as determined by real 
time PGR using a SIM2 derived fragment (SEQ ID NO: 9). Expression of SEQ ID 
NO: 9 was normalized to the expression of RPS27A (SEQ ID NO: 23) housekeeping 
gene. 

30 FIG. 7 is a histogram depicting SIM2 expression as in Figure 6 on a 0-200 

scale. 

FIG. 8 is a histogram showing SIM2 expression in nonnal and tumor derived 
ovarian samples as determined by real time PCR using a SIM2 derived fragments 
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(SEQ ID NOs: 9 and 20). Expression was normalized to the averaged expression of 
three housekeeping genes PBGD (SEQ ID NO: 32), ATP-6-syn (SEQ ID NO: 26) and 
1 8s ribosomal RNA (SEQ ID NO: 29). 

FIG. 9 is a histogram showing SIM2 long variant expression in normal and 
tumor derived ovary samples as determined by real time PGR using a SIM2 derived 
fragment (SEQ ID NO: 18). Expression was normalized to the averaged expression of 
two housekeeping genes PBGD (SEQ ID NO: 32) and HPRTl (SEQ ID NO: 35). 

FIG. 10 is a histogram showing SIM2 long variant expression in normal and 
tumor derived lung samples as determined by real time PGR using a SIM2 derived 
fragment (SEQ ID NO: 18). Expression was normalized to the averaged expression of 
three housekeeping genes SDHA (SEQ ID NO: 38), RPS27A (SEQ ED NO: 23) and 
PBGD (SEQ ID NO: 32). 

FIG. 1 1 is a histogram showing SIM2 short variant expression in normal and 
tumor derived ovarian samples as determined by real time PGR using a SIM2 derived 
fragment (SEQ ID NO: 19). Expression was normalized to the averaged expression of 
two housekeeping genes PBGD (SEQ ID NO: 32) and HPRTl (SEQ ID NO: 35) . 

FIG. 12 is a photomicrograph showing the expression of SIM2 long variant in 
normal and tumor derived breast samples as determined by RT-PGR using a SIM2 
derived fragment (SEQ ID NO: 18). 

FIG. 13 is a photomicrograph showing the expression of SIM2 short variant in 
normal and tumor derived breast samples as determined by RT-PGR using a SIM2 
derived fragment (SEQ ID NO: 19). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of SIM2 polypeptides and polynucleotides encoding 
same, which can be used to diagnose aiid treat ovarian and lung cancers. 

The principles and operation of the present invention may be better understood 
with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it is to be 
understood that the invention is not limited in its application to the details set forth in 
the following description or exemplified by the Examples. The invention is capable of 
other embodiments or of being practiced or carried out in various ways. Also, it is to 
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be understood that the phraseology and terminology employed herein is for the 
purpose of description and should not be regarded as limiting. 

Limg cancer and ovarian cancer are deadly diseases for which treatment is 
currently limited. It is well established that early detection of primary, metastatic and 
recurrent diseases can significantly impact the prognosis of individuals suffering fi*om 
lung cancer and ovarian cancer. 

Current methods for early detection of ovarian cancer involve transabdominal / 
transvaginal ultrsonography and a blood test for the tumor marker CA-125. While 
ultrasound screening is often misleading since most enlarged ovaries result from 
benign cysts, the use of the tumor marker CA-125 in diagnosing ovarian cancer is 
limited due to a high false-positive rate. 

Early detection of lung cancer is currently effected using chest X-rays and 
sputum cytology. Since these methods do not have any effect on mortality, detection 
still may be too late to affect the natural cause of the disease. 

The Drosophila single minded (SIM) gene is the master regulator of fiiiit fly 
meurogenesis [Thomas (1988); Mambu (1991)]. SIM protein is a transcription factor 
containing a basic helix-loop-helix (bHLH) motif, two PAS (PER/ARNT/SIM) 
domains, and an HST (HIF-a/SIM/TRH) domain [Mambu (1991); Isaac and Andrew 
(1996)]. The native human SIM2 gene has been cloned and multiple mRNA products 
of the gene have been found by Northern blot analyses. 

SI1VI2 has been previously associated with colon, pancreatic and prostate 
cancers but not with lung and ovarian cancers (PCT Application No. WO02/12565). 

While reducing the present invention to practice the present inventors 
uncovered, for the first time, that elevated levels of SIM2 are present in ovarian, breast 
and lung tumors, thus providing evidence that this gene can be utilized as a diagnostic 
marker for ovarian, breast and lung tumors, or can serve as a basis for a therapeutic 
agent for treating such tumors. 

Thus, according to one aspect of the present invention there is provided a 
method of diagnosing predisposition to, or presence of ovarian cancer, breast cancer 
and/or lung cancer in a subject. 

As used herein the term "predisposition" refers to a latent susceptibility to 
ovarian, breast and/or lung cancer, which may lead, under certain conditions, to the 
formation of ovarian, breast and/or lung cancer. 



As used herein the phrase "ovarian cancer" refers to epithelial tumors (i.e., 
carcinomas) and non-epithelial tumors (e.g., stroma cell and germ cell tumors of the 
ovary). 

As used herein the phrase "breast cancer" refers to non-invasive and invasive 
5 tumors of the breast including Lobular Carcinoma In Situ (LCIS), Ductal carcinoma. 
Ductal Carcinoma In Situ (DCIS) and Carcinoma In Situ. 

As used herein the phrase "lung cancer" refers to cancers of the lung including 
small cell lung cancer and non-small cell limg cancer. 

The method, according to this aspect of the present invention is effected by 
10 determining a level of SIM2 in a biological sample which is obtained from the subject 
thereby diagnosing predisposition to, or presence of ovarian cancer, breast cancer 
and/or lung cancer in the subject. 

As used herein "a biological sample" " refers to a sample of tissue (e.g., breast, 
lung, ovary) or fluid isolated from a subject, including but not limited to, for example, 
15 plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respiratory, 
intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and 
also samples of in vivo cell culture constituents. 

As used herein, the term "level" refers to expression of SIM2 RNA and/or 
protein or SIM2 DNA copy nximber. 
20 As is ftirther described hereinbelow and in the examples section which follows, 

the present inventors have shown that levels of SIM2 beyond those found in normal 
tissue correlate with predisposition to, or presence or absence of the ovarian cancer, 
breast cancer and/or lung cancer providing evidence that this gene can serve as a 
marker for breast cancer, lung cancer and ovarian cancer. 
25 As used herein SIM2 refers to a SIM2 gene as set forth in sequence coordinates 

34649835 - 34700062 of chromosome 21q22.13, expression products of the SIM2 
gene as well as, fragments and variants thereof. 

As used herein the term "variants" refers to splice variants and allelic variants 
ofSIM2. 

30 The phrase "splice variant" refers to alternative forms of RNA transcribed from 

a SIM2 gene. Splice variation arises naturally through use of altemative splicing sites 
wittiin a transcribed RNA molecule, or less commonly between separately transcribed 
RNA molecules, and may result in several mRNAs transcribed from the same gene. 
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Splice variants may encode polypeptides having altered amino acid sequence. The 
temi splice variant is also used herein to denote a polypeptide encoded by a splice 
variant of an mRNA transcribed from a gene. 

The phrase "allelic variant" refers to two or more altemative forms of a SIM2 
gene occupying the same chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic polymorphism within populations. 
Gene mutations can be silent (no change in the encoded polypeptide) or may encode 
polypeptides having altered amino acid sequence. The term allelic variant is also used 
herein to denote a protein encoded by an allelic variant of a gene. 

Examples of SIM2 expression products (i.e., splice variants) are set forth in 
SEQ ID NOs: 7 (GenBank Accession No. U80456) and 8 (GenBank Accession No. 
U80457), which correspond to the long form and short form of SEM2, respectively. 

It will be appreciated that while reducing the present invention to practice 
additional SIM2 splice variant have been discovered by the present inventors. These 
are set forth in SEQ ID NOs: 2 and 3, further description of which is provided 
hereinbelow and in Figure 1 and Table 1, below. 

Numerous well known tissue or fluid collection (e.g., sputum collection) 
methods can be utilized to collect the biological sample from the subject in order to 
determine the level of SIM2 DNA, RNA and/or polypeptide of the subject. 

Tissue biopsy can be utilized to collect a tissue sample from lung tissue, breast 
tissue and/or ovarian tissue. Methods of performing lung, breast and ovarian biopsies 
are well known in the art. 

Typically, an ovarian biopsy is effected using fine-needle aspiration (FNA), 
which is an important diagnostic tool in gynecology. Its main role is in diagnosis of 
advanced and recurrent gynecologic malignancies. The technique uses a small-gauge 
needle to aspirate a lesion for cytologic analysis, sometimes with the aid of 
radiographic imaging. 

A number of approaches for performing breast biopsies are known in the art. 
A Stereotactic Needle Biopsy is a relatively novel approach, which is 
employed when the physician cannot feel the lump that was found on a mammogram. 
A Stereotactic Biopsy uses mammographic images and computer technology and 
combines them to detemiine the exact location of an abnormality to obtain a sample of 
breast tissue. In this Way, non-surgical techniques are used to determine whether an 



13 

abnormality is cancerous. This procedure is equally accurate as surgical biopsy 
without the scar and anesthetic risks of surgery. The patient lies facedown on a special 
table with an opening for the breast. Using special equipment, the breast is compressed 
similar to a mammogram but with less compression, and then x-rayed from several 
angles. The data is entered into a computer, and then a radiologist inserts a biopsy 
needle into the lump. 

Fine Needle Aspiration involves inserting a very fine, hollow needle into a cyst 
to remove some fluid or tissue. 

Core Needle Biopsy utilizes a larger needle inserted into the breast mass to 
remove a tissue for examination. 

Ductal Lavage is a relatively new, minimally invasive FDA approved 
procedure being used for women who are considered at high risk of developing breast 
cancer. The doctor inserts a catheter into a milk duct and withdraws a sampling of 
cells. 

Lung biopsies can be performed using a variety of techniques. A bronchoscopy 
is preferably effected to retrieve lung tissues which are located deep in the chest. If 
the area lies close to the chest wall, a needle biopsy is often done. If both these 
methods fail, an open surgical biopsy may be carried out. If there are indications that 
the Ixmg cancer has spread to the lymph nodes in the mediastinum, a mediastinoscopy 
is performed. 

When a needle biopsy is to be done, the subject will be given a sedative about 
an hour before the procedure, to help relaxation. The subject sits in a chair with arms 
folded on a table in front. X rays are then taken to identify the location of the 
suspicious areas. Small metal markers are placed on the overlying skin to mark the 
biopsy site. The skin is thoroughly cleansed with an antiseptic solution, and a local 
anesthetic is injected to numb the area. 

A small cut (incision) about half an inch in length is then being made. The 
subject is asked to take a deep breath and hold it while a special biopsy needle is 
inserted through the incision into the lung. When enough tissue has been obtained, the 
needle is withdrawn. Pressure is applied at the biopsy site and a sterile bandage is 
placed over the cut. The entire procedure takes between 30 and 45 minutes. 

The subject may feel a brief sharp pain or some pressure as the biopsy needle is 
inserted. Most subjects, however, do not experience severe pain. 
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Open biopsies are performed in a hospital under general anesthesia. As with 
needle biopsies, subjects are given sedatives before the procedure. An intravenous 
line is placed in the arm to give medications or fluids as necessary. A hollow 
endotracheal tube, is passed through the throat, into the airway leading to the lungs. It 
5 is used to convey the general anesthetic. 

Once the subject is under anesthesia, an incision over the lung area is made. 
Some lung tissue is removed and the cut closed with stitches. The entire procedure 
takes about an hour. A chest tube is sometimes placed with one end inside the lung and 
the other end protruding through the closed incision. Chest tube placement is done to 
10 prevent the lungs from collapsing by removing the air from the lungs. The tube is 
removed a few days following the biopsy. A chest X ray is done following an open 
biopsy, to check for lung collapse. 

The preparation for a mediastinoscopy is similar to that for an open biopsy. 
The subject is sedated and prepared for general anesthesia. The neck and the chest 
15 will be cleansed with an antiseptic solution. Once the subject is under anesthesia, an 
incision of about two or three inches long is made at the base of the neck. A thin, 
hollow, lighted mediastinoscope is inserted through the cut into the space between the 
right and the left lungs. The space is examined thoroughly and any lymph nodes or 
tissues that look abnormal are removed. The mediastinoscope is then removed, and the 
20 incision stitched up and bandaged. 

Regardless of the procedure employed, once a biopsy is obtained the level of 
SIM2 can be determined and a diagnosis can thus be made. 

Determining a level of SIM2 can be effected using various biochemical and 
molecular approaches used in the art for determining gene amplification, and/or level 
25 of gene expression. 

It will be appreciated that since SIM2 is expressed in normal lung, breast and 
ovarian tissues at low levels, detection of SIM2 in a normal tissue is preferably 
effected along side to detect an elevated expression and/or amplification. Samples 
used to determine the normal range of SIM2 can be normal samples from individuals 
30 not suffering from the disease condition. 

Typically, detection of a nucleic acid of interest in a biological sample is 
effected by hybridization-based assays using an oligonucleotide probe. 
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The term "oligonucleotide" refers to a single stranded or double stranded 
oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or 
mimetics thereof This term includes oligonucleotides composed of naturally- 
occurring bases, sugars and covalent intemucleoside linkages (e.g., backbone) as well 
as oligonucleotides having non-naturally-occurring portions which function similarly 
to respective naturally-occurring portions. An example of an oligonucleotide probe 
which can be utilized by the present invention is a single stranded polynucleotide 
which includes a sequence complementary to the sequence region encompassed by 
SEQIDNO:9or 1. 

Oligonucleotides designed according to the teachings of the present invention 
can be generated according to any oligonucleotide synthesis method known in the art 
such as enzymatic synthesis or solid phase synthesis. Equipment and reagents for 
executing solid-phase synthesis are commercially available from, for example. 
Applied Biosystems, Any other means for such synthesis may also be employed; the 
actual synthesis of the oligonucleotides is well within the capabilities of one skilled in 
the art and can be accomplished via established methodologies as detailed in, for 
example, "Molecular Cloning: A laboratory Manual" Sambrook et al., (1989); 
"Current Protocols in Molecular Biology" Volumes I-III Ausubel, R. M., ed. (1994); 
Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and Sons, 
Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John 
Wiley & Sons, New York (1988) and "Oligonucleotide Synthesis" Gait, M. J., ed. 
(1984) utilizing solid phase chemistry, e.g. cyanoethyl phosphoramidite followed by 
deprotection, desalting and purification by for example, an automated trityl-on method 
orHPLC. 

The oligonucleotide of the present invention is of at least 17, at least 18, at 
least 19, at least 20, at least 22, at least 25, at least 30 or at least 40, bases specifically 
hybridizable with SIM2 derived sequence (e.g., SEQ ID NO: 1 which are hybridizable 
with the primers set forth in SEQ ED NOs. 4 and 5). 

The oligonucleotides of the present invention may comprise heterocylic 
nucleosides consisting of purines and the pyrimidines bases, bonded in a 3' to 5' 
phosphodiester linkage. 

Preferably used oligonucleotides are those modified in either backbone, 
intemucleoside linkages or bases, as is broadly described hereinunder. 
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Specific examples of preferred oligonucleotides useful according to this aspect 
of the present invention include oligonucleotides containing modified backbones or 
non-natural intemucleoside linkages. Oligonucleotides having modified backbones 
include those that retain a phosphorus atom in the backbone, as disclosed in U.S. Pat. 

5 NOs: 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 
5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466, 
677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 
5,587,361; and 5,625,050, 

Preferred modified oligonucleotide backbones include, for example, 

10 phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkyl phosphotriesters, methyl and other alkyl phosphonates including 3'- 
alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3 -amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 

15 boranophosphates having normal 3-5* linkages, 2*-5' linked analogs of these, and those 
having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3'-5* 
to 5'-3' or 2'-5* to 5'-2'. Various salts, mixed salts and fi*ee acid forms can also be used. 

Alternatively, modified oligonucleotide backbones that do not include a 
phosphorus atom therein have backbones that are formed by short chain alkyl or 

20 cycloalkyl intemucleoside linkages, mixed heteroatom and alkyl or cycloalkyl 
intemucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
intemucleoside linkages. These include those having morpholino linkages (formed in 
part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide 
and sulfone backbones; formacetyl iand thioformacetyl backbones; methylene 

25 formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate 
backbones; methyleneimino and methylenehydrazino backbones; sulfonate and 
sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 
component parts, as disclosed in U.S. Pat, Nos. 5,034,506; 5,166,315; 5,185,444; 
5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 

30 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 
5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623, 070; 5,663,312; 
5,633,360; 5,677,437; and 5,677,439, . 



Other oligonucleotides which can be used according to the present invention, 
are those modified in both sugar and the intemucleoside linkage, i.e., the backbone, of 
the nucleotide units are replaced with novel groups. The base units are maintained for 
complementation with the appropriate polynucleotide target. An example for such an 

5 oligonucleotide mimetic, includes peptide nucleic acid (PNA). A PNA 
oligonucleotide refers to an oligonucleotide where the sugar-backbone is replaced with 
an amide containing backbone, in particular an aminoethylglycine backbone. The 
bases are retained and are bound directly or indirectly to aza nitrogen atoms of the 
amide portion of the backbone. United States patents that teach the preparation of 

10 PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; 
and 5,719,262, each of which is herein incorporated by reference. Other backbone 
modifications, which can be used in the present invention are disclosed in U.S. Pat. 
No: 6,303,374. 

Oligonucleotides of the present invention may also include base modifications 

15 or substitutions. As used herein, "urmiodified" or "natural" bases include the purine 
bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) 
and uracil (U). Modified bases include but are not limited to other synthetic and 
natural bases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, 
hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and 

20 guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2- 
thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and 
cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8- 
halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and 
guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted 

25 uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3- 
deazaadenine. Further bases include those disclosed in U.S. Pat. No: 3,687,808, those 
disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 
858-859, Kroschwitz, J. I., ed. John Wiley .& Sons, 1990, those disclosed by EngUsch 

30 et aL, Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed 
by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, 
Crooke, S. T. and Lebleu, B. , ed., CRC Press, 1993. Such bases are particularly 
useful for iricfeasing the binding affinity of the oligomeric compounds of the 
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invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 
and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- 
propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic 
acid duplex stability by O.e-Ll'^C. [Sanghvi YS et aL (1993) Antisense Research and 
5 Applications, CRC Press, Boca Raton 276-278] and are presently preferred base 
substitutions, even more particularly when combined with 2-O-methoxyethyl sugar 
modifications. 

It will be appreciated that olignoculeotides of the present invention may 
include further modifications which increase bioavailability, therapeutic efficacy and 
10 reduce cytotoxicity. Such modifications are described in Younes (2002) Current 
Pharmaceutical Design 8: 1451-1466. 

Hybridization based assays which allow the detection of SIM2 (i.e., DNA or 
RNA) in a biological sample rely on the use of oligonucleotide which can be 10, 15, 
20, or 30 to 100 nucleotides long preferably from 10 to 50, more preferably from 40 to 
15 50 nucleotides. 

Hybridization of short nucleic acids (below 200 bp in length, e.g. 17-40 bp in 
length) can be effected using the following examplery hybridization protocols which 
can be modified according to the desired stringency; (i) hybridization solution of 6 x 
SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA 
20 (pH 7.6), 0.5 % SDS, 100 ^g/ml denatured salmon sperm DNA and 0.1 % nonfat dried 
milk, hybridization temperature of 1 - 1.5 ""C below the Tm, final wash solution of 3 M 
TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 
1.5 °C below the Tm; (H) hybridization solution of 6 x SSC and 0.1 % SDS or 3 M 
TMACI, 0.01 M sodium phosphate (pH 6,8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 
25 |ig/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization 
temperature of 2 - 2.5 "^C below the Tm, final wash solution of 3 M TMACI, 0.01 M 
sodium phosphate (pH 6.8),. 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 1.5 °C below the 
Tm, final wash solution of 6 x SSC, and final wash at 22 ^'C; (Hi) hybridization 
solution of 6 x SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 
30 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 (ig/ml denatured salmon sperm DNA and 0.1 
% nonfat dried milk, hybridization temperature. 



The detection of hybrid duplexes can be carried out by a number of methods. 
Typically, hybridization duplexes are separated from unhybridized nucleic acids and 
the labels bound to the duplexes are then detected. Such labels refer to radioactive, 
fluorescent, biological or enzymatic tags or labels of standard use in the art. A label 

5 can be conjugated to either the oligonucleotide probes or the nucleic acids derived 
from the biological sample (target). 

For example, oligonucleotides of the present invention can be labeled 
subsequent to synthesis, by incorporating biotinylated dNTPs or rNTP, or some similar 
means (e.g., photo-cross-linking a psoralen derivative of biotin to RNAs), followed by 

10 addition of labeled streptavidin (e.g., phycoerythrin-conjugated streptavidin) or the 
equivalent. Alternatively, when fluorescently-labeled oligonucleotide probes are 
used, fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, 
Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others [e.g., Kricka et al. 
(1992), Academic Press San Diego, Calif] can be attached to the oligonucleotides. 

15 Traditional hybridization assays include PGR, RT-PCR, RNase protection, in- 

situ hybridization, primer extension, Southem blot, Northem Blot and dot blot 
analysis. 

Those skilled in the art will appreciate that wash steps may be employed to 
wash away excess target DNA or probe as well as unbound conjugate. Further, 
20 standard heterogeneous assay formats are suitable for detecting the hybrids using the 
labels present on the oligonucleotide primers and probes. 

It will be appreciated that a variety of controls may be usefiiUy employed to 
improve accuracy of hybridization assays. For instance, samples may be hybridized to 
an irrelevant probe and treated with RNAse A prior to hybridization, to assess false 
25 hybridization. 

Specifically, gene amplification may be measured directly by DNA analysis 
such as Southem blot or dot blot techniques. For Southem blotting, DNA is extracted 
using methods which are well known in the art, involving tissue mincing, cell lysis, 
protein extraction and DNA precipitation using 2 to 3 volumes of 100% ethanol, 
30 rinsing in 70% ethanol, pelleting, drying and resuspension in water or any other 
suitable buffer (e.g., Tris-EDTA). Preferably, following such procedure, DNA 
concentration is determined such as by measuring the optical density (OD) of the 
sample at 260 nm (wherein 1 unit OD=50 jig/ml DNA). 



To determine the presence of proteins in the DNA solution, the OD 260/OD 
280 ratio is determined. Preferably, only DNA preparations having an OD 260/OD 
280 ratio between 1.8 and 2 are used in the following procedures described 
hereinbelow. 

5 The purified DNA is then digested with one or more restriction enzymes, and 

the resulting fragments separated on an agarose gel by electrophoresis. The DNA 
fragments arc then transferred to a nylon or cellulose nitrate filter by blotting, and the 
DNA fixed by baking. The filter is then exposed to a labeled complementary probe 
and the regions of hybridization detected, usually by autoradiography. Dot blotting is 

10 similar, except that the DNA fragments are not separated on the gel. The degree of 
gene amplification is then determined by dilutional analysis or densitometry scanning. 

Polymerase chain reaction (PCR)-based methods may be used to identify the 
presence of SIM2 mRNA. For PCR-based methods a pair of oligonucleotides is used, 
which is specifically hybridizable with the SIM2 polynucleotide sequences described 

15 hereinabove in an opposite orientation so as to direct exponential amplification of a 
portion thereof (including the hereinabove described sequence alteration) in a nucleic 
acid amplification reaction. For example, an oligonucleotide pair of primers 
specifically hybridizable with SIM2 are set forth in SEQ ID NOs: 10 and 11 which 
provide an amplification product which corresponds to SEQ ID NO: 9. 

20 The polymerase chain reaction and other nucleic acid amplification reactions 

are well known in the art and require no fiirther description herein. The pair of 
oligonucleotides according to this aspect of the present invention are preferably 
selected to have compatible melting temperatures (Tm), e.g., melting temperatures 
which differ by less than that 7 ''C, preferably less than 5 °C, more preferably less than 

25 4 ^'C, most preferably less than 3 ^C, ideally between 3 ""C and 0 X. 

Hybridization to oligonucleotide arrays may be also used to determine SIM2 
expression. Such screening has been undertaken in the BRCAl gene and in the 
protease gene of HIV-1 virus [see Hacia et al., (1996) Nat Genet 1996;14(4):441-447; 
Shoemaker et al., (1996) Nat Genet 1996;14(4):450-456; Kozal et aL, (1996) Nat Med 

30 1 996;2(7):753-759]. 

The nucleic acid sample which includes the candidate region to be analyzed is 
isolated, amplified and labeled with a reporter group. This reporter group can be a 
fluorescent group, such as phycoerythrin. The labeled nucleic acid is then incubated 
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with the probes immobilized on the chip using a fluidics station. For example, Manz 
et al. (1993) Adv in Chromatogr 1993; 33:1-66 describe the fabrication of fluidics 
devices and particularly microcapillary devices, in silicon and glass substrates. 

Once the reaction is completed, the chip is inserted into a scanner and patterns 
of hybridization are detected. The hybridization data is collected, as a signal emitted 
from the reporter groups already incorporated into the nucleic acid, which is now 
bound to the probes attached to the chip. Since the sequence and position of each 
probe inmiobilized on the chip is known, the identity of the nucleic acid hybridized to 
a given probe can be determined. 

It will be appreciated that when utilized along with automated equipment, the 
above described detection methods can be used to screen multiple samples for ovarian, 
breast or lung cancers both rapidly and easily. 

The presence of SIM2 in an ovarian, breast and/or lung tissue can also be 
determined at the protein level. Numerous protein detection assays are known in the 
art, examples include but are not limited to chromatography, electrophoresis, 
immunodetection assays such as ELISA and westem blot analysis, 
immunohistochemistry and the like, which may be effected using antibodies specific 
to SIM2. Thus, the present invention envisages the use of serum immunoglobulins, 
polyclonal antibodies or fragments thereof, (i.e., immimoreactive derivatives thereof), 
or monoclonal antibodies or fragments thereof for the detection of ovarian, breast 
and/or lung cancer. Monoclonal antibodies or purified fragments of the monoclonal 
antibodies having at least a portion of an antigen-binding region, iiicluding the 
fragments described hereinbelow, chimeric or humanized antibodies and 
compiementarily determining regions (CDR). 

The term "antibody" refers to whole antibody molecules as well as functional 
fragments thereof, such as Fab, F(ab')2, and Fv that are capable of binding with 
antigenic portions of the target polypeptide. These functional antibody fragments 
constitute preferred embodiments of the present invention, and are defined as follows: 

(1) Fab, the fragment which contains a monovalent antigen-binding fragment 
of an antibody molecule, can be produced by digestion of whole antibody with the 
enzyme papain to yield an intact light chain and a portion of one heavy chain; 



(2) Fab', the fragment of an antibody molecule that can be obtained by treating 
whole antibody with pepsin, followed by reduction, to yield an intact light chain and a 
portion of the heavy chain; two Fab* fragments are obtained per antibody molecule; 

(3) (Fab*)2, the fragment of the antibody that can be obtained by treating whole 
5 antibody with the enzyme pepsin without subsequent reduction; F(ab')2 is a dimer of 

two Fab' fragments held together by two disulfide bonds; 

(4) Fv, defined as a genetically engineered fragment containing the variable 
region of the light chain and the variable region of the heavy chain expressed as two 
chains; and 

10 (5) Single chain antibody ("SCA"), a genetically engineered molecule 

containing the variable region of the light chain and the variable region of the heavy 
chain, linked by a suitable polypeptide linker as a genetically frised single chain 
molecule as described in, for example, U.S. Patent 4,946,778. 

SIM2-specific antibodies can be commercially obtained from Santa Cruz 

15 [SIM21 (C-17):sc-8716; SIM2s (C-15):sc-8715]. 

Alternatively, S1M2 specific antibodies may be generated using methods, 
which are well known in the art. See for example, Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, New York, 1988, incorporated 
herein by reference). 

20 Purification of serum immunoglobulin antibodies (polyclonal antisera) or 

reactive portions thereof can be accomplished by a variety of methods known to those 
of skill including, precipitation by ammonium sulfate or sodium sulfate followed by 
dialysis against saline, ion exchange chromatography, affinity or immurioaffinity 
chromatography as well as gel filtration, zone electrophoresis, etc. (see Coding in, 

25 Monoclonal Antibodies: Principles and Practice, 2nd ed., pp. 104-126, 1986, Orlando, 
Fla., Academic Press). Under normal physiological conditions antibodies are found in 
plasma and other body fluids and in the membrane of certain cells and are produced by 
lymphocytes of the type denoted B cells or their functional equivalent. Antibodies of 
the IgG class are made up of four polypeptide chains linked together by disulfide 

30 bonds. The four chains of intact IgG molecules are two identical heavy chains referred 
to as H-chains and two identical light chains referred to as L-chains. Additional 
classes include IgD, IgE, IgA, IgM arid related proteins. 
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Methods of generating and isolating monoclonal antibodies are well known in 
the art, as summarized for example in reviews such as Tramontano and Schloeder, 
Methods in Enzymology 178, 551-568, 1989. A recombinant SIM2 polypeptide may 
be used to generate antibodies in vitro. More preferably, the recombinant is used to 

5 elicit antibodies in vivo. In general, a suitable host animal is inmiunized with the 
recombinant SIM2. Advantageously, the animal host used is a mouse of an inbred 
strain. Animals are typically immunized with a mixture comprising a solution of the 
recombinant S1M2 in a physiologically acceptable vehicle, and any suitable adjuvant, 
which achieves an enhanced immune response to the immunogen. By way of 

10 example, the primary immunization conveniently may be accomplished with a mixture 
of a solution of the recombinant SIM2 and Freund*s complete adjuvant, said mixture 
being prepared in the form of a water in oil emulsion. Typically the immunization will 
be administered to the aninials intramuscularly, intradermally, subcutaneously, 
intraperitoneally, into the footpads, or by any appropriate route of administration. The 

15 immunization schedule of the immunogen may be adapted as required, but customarily 
involves several subsequent or secondary immunizations using a milder adjuvant such 
as Freund's incomplete adjuvant. Antibody titers and specificity of binding to the 
SIM2 can be determined during the immunization schedule by any convenient method 
including by way of example radioimmunoassay, or enzyme linked immunosorbant 

20 assay, which is known as the ELISA assay. When suitable antibody titers are 
achieved, antibody-producing lymphocytes from the immunized animals are obtained, 
and these are cultured, selected and cloned, as is known in the art. Typically, 
lymphocytes may be obtained in large numbers from the spleens of immunized 
animals, but they niay also be retrieved from the circulation, the lymph nodes or other 

25 lymphoid organs. Lymphocytes are then ftised with any suitable myeloma cell line, to 
yield hybridomas, as is well known in the art. Alternatively, lymphocytes may also be 
stimulated to grow in culture, and may be immortialized by methods known in the art 
including the exposure of these lymphocytes to a virus, a chemical or a nucleic acid 
such as an oncogene, according to established protocols. After fusion, the hybridomas 

30 are cultured under suitable culture conditions, for example in multi-well plates, and the 
culture supematants are screened to identify cultures containing antibodies that 
recognize the hapten of choice. Hybridomas that secrete antibodies that recognize the 
recombinant S1M2 are cloned by limiting dilution and expanded, under appropriate 



culture conditions. Monoclonal antibodies are purified and characterized in terms of 
immunoglobulin type and binding affinity. 

Antibody fragments according to the present invention can be prepared by 
proteolytic hydrolysis of the antibody or by expression in E. coli or mammalian cells 

5 (e.g. Chinese hamster ovary cell culture or other protein expression systems) of DNA 
encoding the fragment. 

Antibody fragments can be obtained by pepsin or papain digestion of whole 
antibodies by conventional methods. For example, antibody fragments can be 
produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment 

10 denoted F(ab*)2. This fragment can be fiirther cleaved using a thiol reducing agent, and 
optionally a blocking group for the sulfliydryl groups resulting from cleavage of 
disulfide linkages, to produce 3.5S Fab' monovalent fragments. Alternatively, an 
enzymatic cleavage using pepsin produces two monovalent Fab' fragments and an Fc 
fi-agment directly. These methods are described, for example, by Goldenberg, in U.S. 

15 Pat. Nos. 4,036,945 and 4,331,647, and references contained therein, which patents are 
hereby incorporated by reference in their entirety (see also Porter, R. R., Biochem. J., 
73 : 1 19-126, 1959), Other methods of cleaving antibodies, such as separation of heavy 
chains to form monovalent light-heavy chain fragments, further cleavage of fragments, 
or other enzymatic, chemical, or genetic techniques may also be used, so long as the 

20 fi*agments bind to the antigen that is recognized by the intact antibody. 

Fv fragments comprise an association of Vh and Vl chains. This association 
may be noncovalent, as described in Inbar et al. (Proc. Nat'l Acad. Sci. USA 69:2659- 
62, 1972). Alternatively, the variable chains can be linked by an intermolecular 
disulfide bond or cross-linked by chemicals such as glutaraldehyde. Preferably, the Fv 

25 fiaginents comprise Vh and Vl chains connected by a peptide linker. These single- 
chain antigen binding proteins (sFv) are prepared by constructing a structural gene 
comprising DNA sequences encoding the Vh and Vl domains connected by an 
oligonucleotide. The structural gene is inserted into an expression vector, which is 
subsequently introduced into a host cell such as E. coli. The recombinant host cells 

30 synthesize a single polypeptide chain with a linker peptide bridging the two V 
domains. Methods for producing sFvs are described, for example, by Whitlow and 
Filpula, Methods, 2: 97-105, 1991; Bird et al.. Science 242:423-426, 1988; Pack et aL, 



Bio/Technology 11:1271-77, 1993; and Ladner et al., U.S. Pat. No. 4,946,778, all of 
which are hereby incorporated, by reference, in entirety. 

Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides ("minimal recognition 

5 'units") can be obtained by constructing genes encoding the CDR of an antibody of 
interest. Such genes are prepared, for example, by using the polymerase chain reaction 
to synthesize the variable region from RNA of antibody-producing cells (see, for 
example, Larrick and Fry Methods, 2: 106-10, 1991), 

Humanized forms of non-human (e.g., murine) antibodies are chimeric 

10 molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as 
Fv, Fab, Fab', F(ab*)2 or other antigen-binding subsequences of antibodies) which 
contain minimal sequence derived from non-human immunoglobulin. Humanized 
antibodies include human immunoglobulins (recipient antibody) in which residues 
form a complementary determining region (CDR) of the recipient are replaced by 

15 residues from a CDR of a non-human species (donor antibody) such as mouse, rat or 
rabbit having the desired specificity, affinity and capacity. In some instances, Fv 
framework residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Humanized antibodies may also comprise residues, which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences. In 

20 general, the humanized antibody will comprise substantially all of at least one, and 
typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the 
FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant 

25 region (Fc), typically that of a human immunoglobulin [Jones et al.. Nature, 321:522- 
525 (1986); Riechmann et al.. Nature, 332:323-329 (1988); and Presta, Curr. Op. 
Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-hurnan antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into 

30 it from a source, which is non-human. These non-human amino acid residues are often 
referred to as import residues, which are typically taken from an import variable 
domain. Humanization can be essentially performed following the method of Winter 
and co-workers [Jones et al.. Nature, 321:522-525 (1986); Riechrnann et al,. Nature 
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332:323-327 (1988); Verhoeyen et al.. Science, 239:1534-1536 (1988)], by 
substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such humanized antibodies are chimeric antibodies 
(U.S. Pat. No.. 4,816,567), wherein substantially less than an intact human variable 
domain has been substituted by the corresponding sequence from a non-human 
species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al., J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boemer et al. are also available for the preparation of human monoclonal antibodies 
(Cole et al.. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) 
and Boemer et al., J. Immunol., 147(l):86-95 (1991)], Similarly, human monoclonal 
antibodies can be made by introducing human immunoglobulin loci into transgenic 
animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is 
observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for 
example, in U.S. Pat. Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al., Bio/Technology 
10, 779-783 (1992); Lonberg et al.. Nature 368 856-859 (1994); Morrison, Nature 368 
812-13 (1994); Fishwild et al,. Nature Biotechnology 14, 845-51 (1996); Neuberger, 
Nature Biotechnology 14, 826 (1996); Lonberg anjd Muszar, Intem. Rev. Inununol. 13 
65-93 (1995). 

Once the level of SIM2 is determined and the subject diagnosed, the diagnosis 
can be further validated using other diagnostic methods, which rnay also provide an 
accurate staging of the disease in the case of a positive diagnosis. Thus, for example, 
positive diagnosis for ovarian cancer may be confirmed by 
transabdominal/transvaginal ultrasonography and/or a blood test for the tumor marker 
CA-125. Ultrasound screening involves looking for enlarged ovaries aiid a 
transvaginal colour Doppler ultrasound is used to image blood flow. Blood vessel 
formation is thought to discriminate between cancer and benign cysts. Alternatively, 
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positive diagnosis of lung cancer using the method of the present invention can be 
validated using chest X-rays and sputum cytology as well as spiral computed 
tomography (CT) scanning which can detect even very small tumors. Positive 
diagnosis of breast cancer using the method of the present invention can be validated 
using scintimammography, mammography, ultrasound and/or magnetic resonance 
imaging (MRI). 

It will be appreciated that the above-described method of this aspect of the 
present invention may also be used to monitor disease progression and therapeutic 
regimen. 

In addition to diagnostic advances pioneered by the present invention, the 
identification of overexpression of SIM2 in ovarian and lung cancers allows for the 
design of therapeutic agents, which can be used to treat ovarian, breast and lung 
cancers. 

Thus, according to another aspect of the present invention there is provided a 
method of treating a subject (i.e., mammal e.g., human) having ovarian cancer, breast 
2md/or lung cancer. 

As used herein the term "treating" refers to preventing, curing, reversing, 
attenuating, alleviating, minimizing, suppressing or halting the deleterious effects of 
ovarian cancer and/or lung cancer. 

The method according to this aspect of the present invention is effected by 
specifically downregulating expression or activity of SIM2 in a lung tissue, breast 
tissue and/or ovarian tissue to thereby treat the ovarian cancer, breast cancer and/or 
lung cancer in the subject. 

Preferably, the method is effected by providing to the subject a therapeutically 
effective amount of an agent which is capable of downregulating SIM2 expression 
and/or activity. 

As used herein "an agent capable of downregulating SIM2 expression and/or 
activity" refers to a molecule, which is capable of directly or indirectly downregulating 
SIM2 expression or activity. An agent for direct downregulation of SIM2 refers to a 
molecule, which inhibits SIM2 intrinsic activity or expression. An agent for indirect 
downregulation of SIM2 refers to a molecule which inhibits the activity of a SIM2 
effector (e.g., ARNT and HEF-la) or expression thereof. The agents according to this 
aspect of the present invention can be a molecule which binds SIM2 (e.g., an 



antibody); an enzyme which cleaves SIM2; an antisense polynucleotide capable of 
specifically hybridizing with an mRNA transcript encoding SIM2; a SIM2 specific 
aptamer, a ribozyme which specifically cleaves SIM2 transcripts; a non-functional 
analogue of at least a catalytic or binding portion of SIM2 (e.g., a peptide- agent which 

5 can be identified using a phage display technology); a molecule which prevents S1M2 
activation or substrate binding; an siRNA molecule capable of inducing degradation of 
SIM2 transcripts; a DNAzyme which specifically cleaves SIM2 transcripts or DNA, 
an activated double-stranded RNA (dsRNA-dependent protein kinase PKR as 
described in [Shir and Levitzki (2002) Nat, Biotechnol. 20(9):895-900 and Cell Mol 

10 Neurobiol. (2001) 21(6):645-56] and a molecule which is capable of promoting SIM2 
specific immunization response. 

One example, of an agent capable of downregulating a SIM2 is an antibody or 
antibody fragment capable of specifically binding SIM2 or an effector thereof. 
Examples of SIM2 antibodies are described hereinabove. Preferably, such antibodies 

15 are directed at fimctional domains of a target protein.. Thus, for example, a SIM2 
antibody according to this aspect of the present invention is preferably directed at the 
effector binding domain such as the ARNT binding domain. Alternatively, the 
antibody may bind a SIM2 effector such as ARNT or HDF-la. An anti ARNT 
polyclonal rabbit serum raised against residues 1-140 of human ARNT and an anti 

20 HIF-la polyclonal serum raised against residues 786-826 of human HEF-la have been 
described by Susan (2002) J. Biol. Chem. 277:10236-10243. Preferably, the 
antibodies, according to this aspect of the present invention, are humanized such as 
described hereinabove. 

Alternatively an agent capable of downregulating a SIM2 or an effector thereof 

25 can be a protease, which is designed to cleave SIM2. Proteases which can be used to 
cleave SIM2 can be identified by performing a computational analysis such as by 
using the SMART, MEME, MOTIFS, CDD-NCBI, BLOCKS or mPredict software 
each available from http://molbio.info.nih,gov/talks/tools/jobs.html and identifying a 
protease cleavage site, 

30 Alternatively, agents which are designed to inhibit functional domains in the 

SIM2 protein (i.e., protein-protein interaction domains) can be computationally 
identified. For example, various peptide sequences derived from SIM2 can be 
computationally analyzed for an ability to bind ah inhibitor using a variety of three 
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dimensional computational tools. Software programs useful for displaying three- 
dimensional structural models, such as RIBBONS (Carson, M., 1997. Methods in 
Enzymology 277, 25), O (Jones, TA. et a/., 1991. Acta Crystallogr. A47, 110), DINO 
(DINO: Visualizing Structural Biology (2001) http://www.dino3d.org); and 

5 QUANTA, INSIGHT, SYBYL, MACROMODE, ICM, MOLMOL, RASMOL and 
GRASP (reviewed in Kraulis, J., 1991. Appl Crystallogr. 24, 946) can be utilized to 
model interactions between SIM2 and prospective peptide and/or other small 
molecule inhibitors. Computational modeling of protein-peptide interactions has been 
successfully used in rational drug design, for further detail, see Lam et al., 1994. 

10 Science 263, 380; Wlodawer et al., 1993. Ann Rev Biochem. 62, 543; Appelt, 1993. 
Perspectives in Dmg Discovery and Design 1, 23; Erickson, 1993. Perspectives in 
Drug Discovery and Design 1, 109, and Mauro MJ, et al., 2002. J Clin Oncol. 20, 
325-34. Specifically, the PRO^SELECT, tool for the virtual screening of libraries for 
fit to a protein active site, has been used to find novel leads against the serine protease 

15 factor Xa [Liebeschuetz J Med Chem. (2002) ;45(6): 122 1-32]. 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
small interfering RNA (siRNA) molecule. RNA interference is a two-step process, 
the first step, which is termed as the initiation step, input dsRNA is digested into 21-23 
nucleotide (nt) small interfering RNAs (siRNA), probably by the action of Dicer, a 

20 member of the RNase III family of dsRNA-specific ribonucleases, which processes 
(cleaves) dsRNA (introduced directly or via a transgene or a virus) in an ATP- 
dependent manner. Successive cleavage events degrade the RNA to 19-21 bp 
duplexes (siRNA), each with 2-nucleotide 3' overhangs [Hutvagner and Zamore Curr. 
Opin. Genetics and Development 12:225-232 (2002); and Bernstein Nature 409:363- 

25 366 (2001)]. 

In the effector step, the siRNA duplexes bind to a nuclease complex to from 
the RNA-induced silencing complex (RISC). An ATP-dependent unwinding of the 
siRNA duplex is required for activation of the RISC. The active RISC then targets the 
homologous transcript by base pairing interactions and cleaves the mRNA into 12 
30 nucleotide fragments from the 3 ' terminus of the siRNA [Hutvagner and Zamore Curr. 
Opin. Genetics and Development 12:225-232 (2002); Hammond et al. (2001) Nat. 
Rev. Gen. 2:1 10-1 19 (2001); and Sharp Genes. Dev. 15:485-90 (2001)]. Although the 
mechanisrn of cleavage is still to be elucidated, research indicates that each RISC 
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contains a single siRNA and an RNase [Hutvagner and Zamore Curr. Opin. Genetics 

and Development 12:225-232 (2002)]. 

Because of the remarkable potency of RNAi, an amplification step within the 

RNAi pathway has been suggested. Amplification could occur by copying of the input 
5 dsRNAs which would generate more siRNAs, or by replication of the siRNAs formed. 

Alternatively or additionally, amplification could be effected by multiple tumover 

events of the RISC [Hammond et aL Nat. Rev. Gen. 2:110-119 (2001), Sharp Genes. 

Dev. 15:485-90 (2001); Hutvagner and Zamore Curr. Opin. Genetics and 

Development 12:225-232 (2002)], For more information on RNAi see the following 
10 reviews Tuschl ChemBiochem. 2:239-245 (2001); CuUen Nat. Immunol. 3:597-599 

(2002); and Brantl Biochem. Biophys, Act. 1575:15-25 (2002). 

Synthesis of RNAi molecules suitable for use with the present invention can be 

effected as follows. First, the SIM2 mRNA sequence of interest (e.g., SEQ ID NO: 7) 

is scanned downstream of the AUG start codon for AA dinucleotide sequences. 
15 Occurrence of each AA and the 3' adjacent 19 nucleotides is recorded as potential 

siRNA target sites. Preferably, siRNA target sites are selected from the open reading 

frame, as untranslated regions (UTRs) are richer in regulatory protein binding sites. 

UTR-binding proteins and/or translation initiation complexes may interfere with 

binding of the siRNA endonuclease complex [Tuschl ChemBiochem. 2:239-245]. It 
20 will be appreciated though, that siRNAs directed at untranslated regions may also be 

effective, as demonstrated for GAPDH wherein siRNA directed at the 5' UTR 

mediated about 90 % decrease in cellular GAPDH mRNA and completely abolished 

protein level (www.ambion.com/techlib/tn/9 1/91 2.html) . 

Second, potential target sites are compared to an appropriate genomic database 
25 (e.g., human, mouse, rat etc.) using any sequence alignment software, such as the 

BLAST software available from the NCBI server ( www.ncbi.nlm.nih.gov/BLAST/) . 

Putative target sites which exhibit significant homology to other coding sequences are 

filtered out. 

Qualifying target seiquences are selected as template for siRNA synthesis. 
30 Preferred sequences are those including low G/C content as these have proven to be 
iTiofe effective in mediating gene silencing as compared to those with G/C content 
higjier than 55 %. Several target sites are preferably selected along the length of the 
target gene for evaluation. siRNA target sites on SEQ ID NO: 1 are preferably 
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selected from the 5' end i.e., coordinates 1-304 (see Figure 1) e.g., nucleotide 
coordinates 171-193 of SEQ ID NO: 1. For better evaluation of the selected siRNAs, 
a negative control is preferably used in conjunction. Negative control siRNA 
preferably include the same nucleotide composition as the siRNAs but lack significant 
5 homology to the genome. Thus, a scrambled nucleotide sequence of the siRNA is 
preferably used, provided it does not display any significant homology to any other 
gene. 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
DNAzyme molecule capable of specifically cleaving an mRNA transcript or DNA 

10 sequence of the SIM2. DNAzymes are single-stranded polynucleotides which are 
capable of cleaving both single and double stranded target sequences (Breaker, R.R. 
and Joyce, G. Chemistry and Biology 1995;2:655; Santoro, S.W. & Joyce, G.F. Proc. 
Natl, Acad. Sci. USA 1997;943:4262) A general model (the "10-23" model) for the 
DNAzjmie has been proposed. "10-23" DNAzymes have a catalytic domain of 15 

15 deoxyribonucleotides (e.g., nucleotide coordinates 3 1 1-326 of SEQ ID NO: 1), flanked 
by two substrate-recognition domains of seven to nine deoxyribonucleotides each. 
This type of DNAzyme can effectively cleave its substrate RNA at purine:pyrimidine 
junctions (Santoro, S.W. & Joyce, G.F. Proc. Natl, Acad. Sci. USA 199; for rev of 
DNAzymes see Khachigian, LM [Curr Opin Mol Ther 4:1 19-21 (2002)]. 

20 Examples of construction and amplification of synthetic, engineered 

DNAzymes recognizing single and double-stranded target cleavage sites have been 
disclosed in U.S. Pat. No. 6,326,174 to Joyce et al. DNAzymes of similar design 
directed against the human Urokinase receptor were recently observed to inhibit 
Urokinase receptor expression, and successfully inhibit colon cancer cell metastasis in 

25 vivo (Itoh et al , 20002, Abstract 409, Aiin Meeting Am Soc Gen Ther 
wv^w.as gt.org) . In another application, DNAzymes complementary to bcr-abl 
oncogenes were successful in inhibiting the oncogenes expression in leukemia cells, 
and lessening relapse rates in autologous bone marrow transplant in cases of CML and 
ALL. 

30 Downregulation of a SIM2 or an effector thereof can also be effected by using 

an antisense polynucleotide capable of specifically hybridizing with an mRNA 
transcript encoding the SIM2 transcripts. 



Design of antisense molecules, which can be used to efficiently downregulate a 
S1M2 must be effected while considering two aspects important to the antisense 
approach. The first aspect is delivery of the oligonucleotide into the cytoplasm of the 
appropriate cells, while the second aspect is design of an oligonucleotide which 

5 specifically binds the designated mRNA within cells in a way which inhibits 
translation thereof. An example antisense oligonucleotide which can be used in 
accordance with the present invention is designed to hybridize to nucleotide 
coordinates 311-400 of SEQ ID NO: 2. Such an antisense molecule hybridizes to a 
sequence which is shared by all SIM2 expression products known to date and as such 

10 may be useful in silencing SIM2 expression. 

The prior art teaches of a number of delivery strategies which can be used to 
efficiently deliver oligonucleotides into a wide variety of cell types [see, for example, 
Luft J Mol Med 76: 75-6 (1998); Kronenwett et al. Blood 91: 852-62 (1998); Rajur et 
al. Bioconjug Chem 8: 935-40 (1997); Lavigne et al. Biochem Biophys Res Commun 

15 237: 566-71 (1997); and Aoki et al. (1997) Biochem Biophys Res Commun 231: 540- 
5 (1997)]. 

In addition, algorithms for identifying those sequences with the highest 
predicted binding affinity for their target mRNA based on a thermodynamic cycle that 
accounts for the energetics of structural alterations in both the target mRNA and the 
20 oligonucleotide are also available [see, for example, Walton et aL Biotechnol Bioeng 
65: 1-9(1999)]. 

Such algorithms have been successfully used to implement an antisense 
approach in cells. For example, the algorithm developed by Walton et al. enabled 
scientists to successfully design antisense oligonucleotides for rabbit beta-globin 

25 (RBG) and mouse tumor necrosis factor-alpha (TNF alpha) transcripts. The same 
research group has more recently reported that the antisense activity of rationally 
selected oligonucleotides against three model target mRNAs (human lactate 
dehydrogenase A and B and rat gpl30) in cell culture as evaluated by a kinetic PGR 
technique proved effective in almost all cases, including tests against three different 

30 targets in two cell types with phosphodiester and phosphorothioate oligonucleotide 
chemistries; 



In addition, several approaches for designing and predicting efficiency of 
specific oligonucleotides using an in vitro system were also published (Matveeva et 
al.. Nature Biotechnology 16: 1374 - 1375 (1998)]. 

Several clinical trials have demonstrated safety, feasibility and activity of 
5 antisense oligonucleotides. For example, antisense oligonucleotides suitable for the 
treatment of cancer have been successfully used [Holmund et aL, Curr Opin Mol Ther 
1:372-85 (1999)], while treatment of hematological malignancies via antisense 
oligonucleotides targeting c-myb gene, p53 and Bcl-2 had entered clinical trials and 
had been shown to be tolerated by patients [Gerwitz Curr Opin Mol Ther 1:297-306 
10 (1999)]. 

More recently, antisense-mediated suppression of human heparanase gene 
expression has been reported to inhibit pleural dissemination of human cancer cells in 
a mouse model [Uno et al.. Cancer Res 61:7855-60 (2001)]. 

Thus, the current consensus is that recent developments in the field of antisense 

15 technology which, as described above, have led to the generation of highly accurate 
antisense design algorithms and a wide variety of oligonucleotide delivery systems, 
enable an ordinarily skilled artisan to design and implement antisense approaches 
suitable for downregulating expression of known sequences without having to resort to 
undue trial and error experimentation. 

20 It will be appreciated that antisense oligonucleotides can be used to modulate 

alternative splicing from SIM2 gene [Suzani and Kole (2003) Progress in Molecular 
and Subcellular Biology vol. 31 Philippe Jeanteur (Ed.) Springer-Verlag Berlin 
Heidelberg]. Inhibition of splicing by antisense oligonucleotides can be accomplished 
by targeting oligonucleotides to small nuclear RNAs (snRNAs), which participate in 

25 spliceosome formation and are essential for splicing and to splice sites and adjacent 
sequences [reviewed in Kole (1991) Adv. Dmg Delivery Rev. 6:271-286], 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
ribozyme molecule capable of specifically cleaving an mRNA transcript encoding a 
SIM2 for example. Ribozymes are being increasingly used for the sequence-specific 

30 inhibition of gene expression by the cleavage pf mRNAs encoding proteins of interest 
[Welch et aL, Curr Opin Biotechnol. 9:486-96 (1998)]. The possibility of designing 
ribozymes to cleave any specific target RNA has rendered them valuable tools in both 
basic research and therapeutic applications. In the therapeutics area, ribozymes have 
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been exploited to target viral RNAs in infectious diseases, dominant oncogenes in 
cancers and specific somatic mutations in genetic disorders [Welch et al., Clin Diagn 
Virol. 10:163-71 (1998)]. Most notably, several ribozyme gene therapy protocols for 
HIV patients are already in Phase 1 trials. More recently, ribozymes have been used 
for transgenic animal research, gene target validation and pathway elucidation. 
Several ribozymes are in various stages of clinical trials. ANGIOZYME was the first 
chemically synthesized ribozyme to be studied in human clinical trials. 
ANGIOZYME specifically inhibits formation of the VEGF-r (Vascular Endothelial 
Growth Factor receptor), a key component in the angiogenesis pathway. Ribozyme 
Pharmaceuticals, Inc., as well as other firms have demonstrated the importance of anti- 
angiogenesis therapeutics in animal models. HEPTAZYME, a ribozyme designed to 
selectively destroy Hepatitis C Virus (HCV) RNA, was found effective in decreasing 
Hepatitis C viral RNA in cell culture assays (Ribozyme Pharmaceuticals, Incorporated 
- WEB home page). 

Alternatively, the agent can be a molecule, which promotes a SIM2-specific 
immunogenic response in the subject. The molecule can be a SIM2 protein, a 
fi-agment derived therefi*om or a nucleic acid sequence encoding thereof. Although 
such a molecule can be provided to the subject per se, the agent is preferably 
administered with an immunostimulant in an immunogenic composiiton. An 
inununostimulant may be any substance that enhances or potentiates an immune 
response (antibody and/or cell-mediated) to an exogenous antigen. Examples of 
immunostimulants include adjuvants, biodegradable microspheres (e.g., polylactic 
galactide) and liposomes into which the compound is incorporated (see e.g., U.S. Pat. 
No. 4,235,877). Vaccine preparation is generally described in, for example, M. F. 
Powell arid M. J. Newman, eds., "Vaccine Design (the subunit and adjuvant 
approach)," Plenum Press (NY, 1995). 

Illustrative immunogenic compositions may contain DNA encoding one or 
more of the SIM2 polypeptides as described above, such that the polypeptide is 
generated in situ. The DNA may be present within any of a variety of delivery 
systems known to those of ordinary skill in the art, including nucleic acid expression 
systems (see below), bacteria and viral expression systems.. Numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 15:143-198, 1998, and references cited therein. 



Appropriate nucleic acid expression systems contain the necessary DNA sequences 
for expression in the subject (such as a suitable promoter and terminating signal). 
Bacterial delivery systems involve the administration of a bacterium (such as 
Bacillus-Calmette-Guerrin) that expresses an inmiunogenic portion of the polypeptide 

5 on its cell surface or secretes such an epitope. In a preferred embodiment, the DNA 
may be introduced using a viral expression system (e.g., vaccinia or other pox virus, 
retrovirus, or adenovims), which may involve the use of a non-pathogenic (defective), 
replication competent virus. Suitable systems are disclosed, for example, in Fisher- 
Hoch et al., Proc. Natl. Acad. Sci. USA 86:317-321, 1989; Flexner et al., Ann. N.Y 

10 Acad. Sci, 569:86-103, 1989; Flexner et al.. Vaccine 8:17-21, 1990; U.S. Pat Nos. 
4,603,112, 4,769,330, and 5,017,487; WO 89/01973; U.S. Pat. No. 4,777,127; GB 
2,200,651; EP 0,345,242; WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; 
Rosenfeld et al.. Science 252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA 
91:215-219, 1994; Kass-Eisler et al., Proc. Natl. Acad. Sci. USA 90:11498-11502, 

15 1993; Guzman et al. Circulation 88:2838-2848, 1993; and Guzman et al., Cir. Res. 
73:1202-1207, 1993. Techniques for incorporating DNA into such expression systems 
are well known to those of ordinary skill in the art. The DNA may also be "naked," as 
described, for example, in Ulmer et al.. Science 259:1745-1749, 1993 and reviewed 
by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be 

20 increased by coating the DNA onto biodegradable beads, which are efficiently 
transported into the cells. 

It will be appreciated that an immunogenic composition may comprise both a 
polynucleotide and a polypeptide component. Such immunogenic compositions may 
provide for ari enhanced immune response. 

25 Any of a variety of immunostimulants may be employed in the immunogenic 

compositions of this invention. For example, an adjuvant may be included. Most 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 
such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, 
such as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 

30 Suitable adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.); Merck 
Adjuvant 65 (Merck and Compahy, Inc., Rahway, N.J.); AS-2 (SmithKline Beecham, 
Philadelphia, Pa.); aluminum salts such as aluminum hydroxide gel (alum) or 
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aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 
acylated tyrosine; acylated sugars; cationically or anionically derivatized 
polysaccharides; polyphosphazenes; biodegradable microspheres; moriophosphoryl 
lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2,-7, or -12, may also 

5 be used as adjuvants. 

The adjuvant composition may be designed to induce an immune response 
predominantly of the Thl type. High levels of Th I -type cytokines (e.g., IFN-.gamma., 
TNF.alpha., IL-2 and IL-12) tend to favor the induction of cell mediated immune 
responses to an administered antigen. Li contrast, high levels of Th2-type cytokines 

10 (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral immune 
responses. Following application of an immunogenic composition as provided herein, 
the subject will support an immune response that includes Thl- and Th2-type 
responses. The levels of these cytokines may be readily assessed using standard 
assays. For a review of the families of cytokines, see Mosmann and Coffinan, Ann. 

15 Rev. Immunol. 7:145-173, 1989. 

Preferred adjuvants for use in eliciting a predominantly Thl -type response 
include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O- 
acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL 
adjuvants are available from Corixa Corporation (Seattle, Wash.; see U.S. Pat. Nos. 

20 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides 
(in which the CpG dinucleotide is unmethylated) also induce a predominantly Thl 
response. Such oligonucleotides are well known and are described, for example, in 
WO 96/02555, WO 99/33488 and U.S. Pat. Nos. 6,008,200 and 5,856,462. 
Immunostimulatory DNA sequences are also described, for example, by Sato et al., 

25 Science 273:352, 1996. Another preferred adjuviant is a saponin, preferably QS21 
(Aquila Biopharmaceuticals Inc., Framingham, Mass.), which may be used alone or in 
combination \yith other adjuvants. For example, an enhanced system involves the 
combination of a monophosphoryl lipid A and saponin derivative, such as the 
combination of QS21 and 3D-MPL as described in WO 94/00153, or a less 

30 reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. A particularly potent adjuvant formulation involving QS21, 3D-MPL and 
tocopherol in an oil-in-water emulsion is described in WO 95/1 7210. 



37 

Other preferred adjuvants include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, Calif, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of 
adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Corixa, Hamilton, Mont.), RC-529 (Corixa, Hamilton, Mont.) and 

5 other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in 
pending U.S. patent application Ser. Nos. 08/853,826 and 09/074,720. 

A delivery vehicle may be employed within the immunogenic composition of 
the present invention to facilitate production of an antigen-specific immune response 
that targets tumor cells. Delivery vehicles include antigen presenting cells (APCs), 

10 such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may be genetically modified to increase 
the capacity for presenting the antigen, to improve activation and/or maintenance of 
the T cell response, to have anti-tumor effects per se and/or to be immunologically 
compatible with the receiver (i.e., matched HLA haplotype). APCs may generally be 

15 isolated from any of a variety of biological fluids and organs, including tumor and 
peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic 
cells. 

Dendritic cells are highly potent APCs (Banchereau and Steinman, Nature 
392:245-251, 1998) and have been shown to be effective as a physiological adjuvant 

20 for eliciting prophylactic or therapeutic antitumor immunity (see Timmeman and 
Levy, Ann. Rev. Med, 50:507-529, 1999). In general, dendritic cells may be identified 
based on their typical shape (stellate in situ, with marked cytoplasmic processes 
(dendrites) visible in vitro), their ability to take up, process and present antigens with 
high efficiency and their ability to activate naive T cell responses. Dendritic cells 

25 may, of course, be engineered to express specific cell-surface receptors or ligands that 
are hot commonly found on dendritic cells in vivo or ex vivo, and such modified 
dendritic cells are contemplated by the present invention. As an altemative to 
dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may 
be used within aii immunogenic composition (see Zitvogel et al., Nature Med. 4:594- 

30 600, 1998). 

Dendritic cells and progenitors niay be obtained from peripheral blood, bone 
marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, 
spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For exaniple. 
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dendritic cells may be differentiated ex vivo by adding a combination of cytokines 
such as GM-CSF, IL-4, IL-13 and/or TNF.alpha. to cultures of monocytes harvested 
from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral 
blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells 

5 by adding to the culture medium combinations of GM-CSF, IL-3, TNF.alpha., CD40 
ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are categorized as "immature" and "mature" cells, which allows 
a simple way to discriminate between two well characterized phenotypes. Immature 

10 dendritic cells are characterized as APC with a high capacity for antigen uptake and 
processing, which correlates with the high expression of Fey receptor and mannose 
receptor. The mature phenotype is typically characterized by a lower expression of 
these markers, but a high expression of cell surface molecules responsible for T cell 
activation such as class I and class II MHC, adhesion molecules (e.g., CD54 and 

15 GDI 1) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-lBB). 

APCs may generally be transfected with a polynucleotide encoding SIM2, 
such that SIM2, or an immunogenic portion thereof, is expressed on the cell surface. 
Such transfection may take place ex vivo, and a composition comprising such 
transfected cells may then be used for therapeutic purposes, as described herein. 

20 Alternatively, a gene delivery vehicle that targets a dendritic or other antigen 
presenting cell may be administered to the subject, resulting in transfection that 
occurs in vivo. In vivo and ex vivo transfection of dendritic cells, for example, may 
generally be performed using any methods known in the art, such as those described 
in WO 97/24447, or the gene gun approach described by Mahvi et al.. Immunology 

25 and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may be 
achieved by incubating dendritic cells or progenitor cells with the SIM2 polypeptide, 
DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing 
recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or ientivifils 
vectors). Prior to loading, the polypeptide may be covalently conjugated to an 

30 immunological partner that provides T cell help (e.g., a carrier molecule) such as 
described above. Altematively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 
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Agents for downregulating expression or activity of SIM2 (i.e., active 
ingredients) of the present invention can be provided to the subject per se^ or as part of 
a pharmaceutical composition where they are mixed with a pharmaceutically 
acceptable carrier. 

5 As used herein a "pharmaceutical composition" refers to a preparation of one 

or more of the active ingredients described herein with other chemical components 
such as physiologically suitable carriers and excipients. The purpose of a 
pharmaceutical composition is to facilitate administration of a compound to an 
organism. 

10 Herein the term "active ingredient" refers to the preparation accountable for the 

biological effect. 

Hereinafter, the phrases "physiologically acceptable carrier" and 
"pharmaceutically acceptable carrier" which may be interchangeably used refer to a 
carrier or a diluent that does not cause significant irritation to an organism and does 

15 not abrogate the biological activity and properties of the administered compound. An 
adjuvant is included under these phrases. One of the ingredients included in the 
pharmaceutically acceptable carrier can be for example polyethylene glycol (PEG), a 
biocompatible polymer with a wide range of solubility in both organic and aqueous 
media (Mutter et al. (1979). 

20 Herein the term "excipient" refers to an inert substance added to a 

pharmaceutical composition to further facilitate administration of an active ingredient. 
Examples, without limitation, of excipients include calcium carbonate, calcium 
phosphate, various sugars and types of starch, cellulose derivatives, gelatin, vegetable 
oils and polyethylene glycols. 

25 Techniques for formulation and administration of drugs may be found in 

"Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition, which is incorporated herein by reference. 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, especially transnasal, intestinal or parenteral delivery, including 

30 intramuscular, subcutaneous and intramedullary injections as well as intrathecal, direct 
intraventricular, intravenous, inrtaperitoneal, ititranasal, or intraocular injections. 
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Alternately, one may administer a preparation in a local rather than systemic 
manner, for example, via injection of the preparation directly into a specific region of 
a patient's body. 

Pharmaceutical compositions of the present invention may be manufactured by 
processes well known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present invention 
may be formulated in conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries, which facilitate processing 
of the active ingredients into preparations which, can be used pharmaceutically. 
Proper formulation is dependent upon the route of administration chosen. 

For injection, the active ingredients of the invention may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hank's 
solution. Ringer's solution, or physiological salt buffer. For transmucosal 
administration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by 
coinbining the active compounds with pharmaceutically acceptable carriers well 
known in the art. Such carriers enable the compounds of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like, for oral ingestion by a patient. Pharmacological 
preparations for oral use can be made using a solid excipient, optionally grinding the 
resulting mixture, and processing the mixture of granules, after adding suitable 
auxiliaries if desired, to obtain tablets or dragee, cores. Suitable excipients are, in 
particular, fillers such as sugars, including lactose, sucrose, maimitol, or sorbitol; 
cellulose preparations such eis, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl- 
cellulose, sodium carbomethylcellulose; and/or physiologically acceptable polymers 
such as polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, 
such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such 
as sodium alginate. 



Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, 
lacquer solutions and suitable organic solvents or solvent mixtures. Dyestuffs or 

5 pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compoxmd doses. 

Pharmaceutical compositions, which can be used orally, include push-fit 
capsules made of gelatin as well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules may contain the active 

10 ingredients in admixture with filler such as lactose, binders such as starches, lubricants 
such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the 
active ingredients may be dissolved or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycols, hi addition, stabilizers may be added. 
All formulations for oral administration should be in dosages suitable for the chosen 

15 route of administration. 

For buccal administration, the compositions may take the form of tablets or 
lozenges formulated in conventional maimer. 

For administration by nasal inhalation, the active ingredients for use according 
to the present invention are conveniently delivered in the form of an aerosol spray 

20 presentation from a pressurized pack or a nebulizer with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichloro- 
tetrafluoroethane or carbon dioxide. Li the case of a pressurized aerosol, the dosage 
unit may be determined by providing a valve to deliver a metered amount. Capsules 
and cartridges of, e.g., gelatin for use in a dispenser may be formulated containing a 

25 powder mix of the compound and a suitable powder base such as lactose or starch. 

The preparations described herein may be formulated for parenteral 
administration, e.g., by bolus injection or continuous inftisioti. Formulations for 
injection may be presented in unit dosage form, e;g., in ampoules or iri multidose 
containers with optionally, an added preservative. The compositioiis may be 

30 suspensions, solutions or ernulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical compositions for parenteral administration include aqueous 
solutions of the active preparation in water-soluble form. Additionally, suspensions of 



the active ingredients may be prepared as appropriate oily or water based injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame 
oil, or synthetic fatty acids esters such as ethyl oleate, triglycerides or liposomes. 
Aqueous injection suspensions may contain substances, which increase the viscosity of 

5 the suspension, such as sodium carboxymethyl cellulose, sorbitol or dextran. 
Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the active ingredients to allow for the preparation of highly 
concentrated solutions. 

Alternatively, the active ingredient may be in powder form for constitution 

10 with a suitable vehicle, e.g., sterile, pyrogen-free water based solution, before use. 

The preparation of the present invention may also be formulated in rectal 
compositions such as suppositories or retention enemas, using, e.g., conventional 
suppository bases such as cocoa butter or other glycerides. 

Pharmaceutical compositions suitable for use in context of the present 

15 invention include compositions wherein the active ingredients are contained in an 
amount effective to achieve the intended purpose. More specifically, a therapeutically 
effective amount means an amount of active ingredients effective to prevent, alleviate 
or ameliorate symptoms of disease or prolong the survival of the subject being treated. 
Determination of a therapeutically effective amount is well within the 

20 capability of those skilled in the art. 

For any preparation used in the methods of the invention, the therapeutically 
effective amount or dose can be estimated initially froni in vitro assays. For example, 
a dose can be formulated in animal models and such information can be used to more 
accurately determine useful doses in humaiis. 

25 Toxicity and therapeutic efficacy of the active ingredients described herein can 

be determined by standard pharmaceutical procedures in vitro j in cell cultures or 
experimental animals. The data obtained from these in vitro and cell culture assays 
and animal studies can be used in formulating a range of dosage for use in humaii. 
The dosage may vary depending upon the dosage form employed and the route of 

30 administration utilized. The exact formulation, route of administration and dosage can 
be chosen by the individual physician in view of the patienfs condition. (See e.g., 
Fingl, et al., 1975, in "The Phairmacological Basis of Therapeutics", Ch. 1 p.l ). 
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Depending on the severity and responsiveness of the condition to be treated, 
dosing can be of a single or a plurality of administrations, with course of treatment 
lasting from several days to several weeks or until cure is effected or diminution of the 
disease state is achieved. 

5 The amount of a composition to be administered will, of course, be dependent 

on the subject being treated, the severity of the affliction, the manner of 
administration, the judgment of the prescribing physician, etc. 

Compositions including the preparation of the present invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an appropriate 

10 container, and labeled for treatment of an indicated condition. 

Pharmaceutical compositions of the present invention may, if desired, be 
presented in a pack or dispenser device, such as an FDA approved kit, which may 
contain one or more unit dosage forms containing the active ingredient. The pack 
may, for example, comprise metal or plastic foil, such as a blister pack. The pack or 

15 dispenser device may be accompanied by instructions for administration. The pack or 
dispenser may also be accommodated by a notice associated with the container in a 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the form of 
the compositions or human or veterinary administration. Such notice, for example, 

20 may be of labeling approved by the U.S. Food and Drug Administration for 
prescription drugs or of an approved product insert. 

It will be appreciated that the oligonucleotides of the present invention can also 
be expressed from a nucleic acid construct, which can be administered to the subject 
employing any suitable mode of administration, described hereinabove (e.g., in- vivo 

25 gene therapy). Such a nucleic acid construct is introduced into a target cell or cells via 
appropriate gene delivery vehicle/methods (transfection, transduction, homologous 
recombination, etc.) and an expression system as needed and then the modified cells 
are expanded in culture and retumed to the subject (i.e., ex-vivo gene therapy). 

Such expression constructs may include a tissue-specific promoter for directing 

30 expression of the dowiiregulating agents in the malignant tissue. Thus an ovarian 
specific promoter such as OSP-1 [Kumaran Cancer Res. (2001) Feb 1 5;6 1(4): 1291- 
5]and IAI.3B [Hamada Cancer Res. (2003) May 15;63(10):2506-12] itiay be used. A 
breast-specific promoter which may be used in accordance with the present invention 
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includes rNRL, Muc-1 and mWAP [Hiripi (2003) DNA Cell Biol. 22:41-5; Berger 
(2001) Breast Cancer Res. 3:28-35]. Alternatively a lung specific promoter such as 
CC-10 [Harrod Am J Respir Cell Mol Biol. (2002) Feb;26(2):216-23]and SP-C [Duan 
Oncogene. (2002) Nov 7;21(51):7831-8]may be used 

5 Expression of duplex oHgonucleotides is preferably effected via expression 

vectors specifically designed for such use. For example, the pSUPER™ including the 
polymerase-III Hl-RNA gene promoter with a well defined start of transcription and a 
termination signal consisting of five thymidines in a row (T5) [Brummelkamp (2002) 
Science 296:550-53]. Another suitable siRNA expression vector encodes the sense 

10 and antisense siRNA imder the regulation of separate polIII promoters [Miyagishi and 
Taira[(2002) Nature Biotech. 20:497-500]. The resultant siRNA includes 5 thymidine 
termination signal. Alternatively, oligonucleotide sequences can be placed under bi- 
directional promoters to produce both the sense and antisense transcripts fi*om the 
same promoter construct, thus simplifying the construction of expression vectors and 

15 achieving an equal molar ratio of cellular sense and antisense sequences. Examples 
for bi-directional promoters are disclosed in U.S. Pat. AppL No. 20020108142. 
It will be appreciated that when duplex oligonucleotide are used, transfection reagents 
dedicated to siRNA transfer to mammalian cells are preferably employed. Examples 
for such include but are not limited to siPORT™ Amine (i.e., a polyamine mixture) 

20 and siPORT Lipid (i.e., a mixture of cationic and neutral lipids). 

Accordingly, in cases where the duplex oligonucleotides of the present 
invention are introduced into a cell in which RNA interference (RNAi) does not 
normally occur, the factors needed to mediate RNAi are introduced into such a cell or 
the expression of the needed factors is induced, as disclosed in U.S. Pat. AppL No.: 

25 20020086356. 

It will be appreciated that treatment of subjects exhibiting mutated SIM2 
transcripts may also be effected using a "knock in" strategy (see U,S. Pat. No. 
6,265,632), wherein endogenous SIM2 sequence alterations are corrected using 
advanced gene therapy. 

30 As is mentioned hereinabove, the present inventors uncovered novel iso forms 

ofSIM-2. 

Thus, according to another aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence encoding a polypeptide 



being at least 60 %, at least 65 %, at least 70 %, at least 75 %, at least 80 %, at least 82 
%, at least 86 %, at least 88 %, at least 90 %, at least 92 %, at least 94 % or more, say 
95 % - 100 % homologous to SEQ ID NO: 39, as determined using the BestFit 
software of the Wisconsin sequence analysis package, utilizing the Smith and 
5 Waterman algorithm, where the gap creation equals 8 and gap extension penalty 
equals 2. 

According to one preferred embodiment of this aspect of the present invention 
the isolated polynucleotide is at least 60 %, at least 65 %, at least 70 %, at least 75 %, 
at least 80 %, at least 82 %, at least 86 %, at least 88 %, at least 90 %, at least 92 %, at 

10 least 94 % or more, say 95 % - 100 % identical to SEQ ID NO: 39, as determined 
using BestFit software of the Wisconsin sequence analysis package, utilizing the 
Smith and Waterman algorithm, where gap weight equals 50, length weight equals 3, 
average match equals 10 and average mismatch equals -9. 

According to another preferred embodiment of this aspect of the present 

15 invention the isolated polynucleotide is as set forth in SEQ ID NO: 2. 

According to yet another aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence encoding a polypeptide 
being at least 50 %, at least 5 %, at least 60 %, at least 65 %, at least 70 %, at least 75 
%, at least 80 %, at least 82 %, at least 86 %, at least 88 %, at least 90 %, at least 92 

20 %, at least 94 % or more, say 95 % - 100 % homologous to SEQ ID NOs: 40 or 41, as 
determined using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where the gap creation equals 8 and gap 
extension penalty equals 2. 

According to one preferred embodiment of this aspect of the present inveiition 

25 the isolated polynucleotide is at least 50 %, at least 5 %, at least 60 %, at least 65 %, at 
least 70 %, at least 75 %, at least 80 %, at least 82 %, at least 86 %, at least 88 %, at 
least 90 %, at least 92 %, at least 94 % or more, say 95 % - 100 % identical to SEQ ID 
NOs: 40 or 41, as detemiined using BestFit software of the Wisconsin sequence 
analysis package, utilizing the Smith and Waterman algorithm, where gap weight 

30 equals 50, length weight equals 3, average match equals 10 and average mismatch 
equals -9. 

According to another preferred embodiment of this aspect of the present 
invention the isolated polynucleotide is as set forth in SEQ ID NO: 3. 
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As used herein the phrase "an isolated polynucleotide" refers to a single or 
double stranded nucleic acid sequences which is isolated and provided in the form of 
an RNA sequence, a complementary polynucleotide sequence (cDNA), a genomic 
polynucleotide sequence and/or a composite polynucleotide sequences (e.g., a 

5 combination of the above). 

As used herein the phrase "complementary polynucleotide sequence" refers to 
a sequence, which results from reverse transcription of messenger RNA using a 
reverse transcriptase or any other RNA dependent DNA polymerase. Such a sequence 
can be subsequently amplified in vivo or in vitro using a DNA dependent DNA 

10 polymerase. 

As used herein the phrase "genomic polynucleotide sequence" refers to a 
sequence derived (isolated) from a chromosome and thus it represents a contiguous 
portion of a chromosome. 

As used herein the phrase "composite polynucleotide sequence" refers to a 

15 sequence, which is at least partially complementary and at least partially genomic. A 
composite sequence can include some exonal sequences required to encode the 
polypeptide of the present invention, as well as some intronic sequences interposing 
therebetween. The intronic sequences can be of any source, including of other genes, 
and typically will include conserved splicing signal sequences. Such intronic 

20 sequences may further include cis acting expression regulatory elements. 

Since the polynucleotide sequences of the present invention encode previously 
unidentified polypeptides, the present invention . also encompasses isolated 
polypeptides or portions thereof which are encoded by the isolated polynucleotide 
which are described hereinabove. 

25 Thus, this aspect of the present invention also encompasses polypeptides which 

are set forth in SEQ ID NO: 39, 40 or 41, homologues thereof (selected from the 
homology range oiF 60-100 % described hereinabove) fragments thereof and altered 
polypeptides characterized by mutations, such as deletion, insertion or substitution of 
one or more amino acids, either naturally occurring or man induced, either randomly 

30 or in a targeted fashion. 

Since expression of SIM2 is correlatable with cancer development (see WO 
02/12565) the present invention also envisages the use of the novel sequences in 
diagnosis and treatment of cancer. Examples include, but are not limited to bone 
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cancers, brain tumors, breast cancer, endocrine system cancers, gastrointestinal 
cancers, gynecological cancers, head and neck cancers, leukemia, lymphomas, 
metastases, myelomas, pediatric cancers, penile cancer, prostate cancer, sarcomas, skin 
cancers, thyroid cancer, thyoma, urinary tract cancers, carcinoma of unknown primary 
and Li-Fraumeni syndrome. 

As is illustrated in the Examples section which follows, the present inventors 
have shown through laborious experimentation that these sequences are differentially 
expressed in colon adenocarcinoma, in lung adenocarcinoma and in lung squamous 
cell carcinoma supporting the use of such sequences in diagnosis and treatment of 
cancer as described above. 

Additional objects, advantages, and novel features of the present invention will 
become apparent to one ordinarily skilled in the art upon examination of the following 
examples, which are not intended to be limiting. Additionally, each of the various 
15 embodiments and aspects of the present invention as delineated hereinabove and as 
claimed in the claims section below finds experimental support in the following 
examples. 

EXAMPLES 

20 Reference is now made to the following examples, which together with the 

above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures utilized 
in the present invention include molecular, biochemical, microbiological and 
recombinant DNA techniques. Such techniques are thoroughly explained in the 

25 Uterature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et 
al., (1989); "Current Protocols in Molecular Biology" Volumes Mil Ausubel, R. M., 
ed. (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and 
Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", 
John Wiley & Sons, New York (1988); Watson et aL, "Recombinant DNA", Scientific 

30 American Books, New York; Birren et aL (eds) "Genome Analysis: A Laboratory 
Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); 
methodologies as set forth in U.S. Pat, Nos. 4,666,828; 4,683,202; 4,801,531; 
5,192,659 arid 5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-III 
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Cellis, J. E., ed. (1994); "Current Protocols in Immunology" Volumes I-III Coligan J, 
E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" (8th Edition), 
Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), "Selected Methods 
in Cellular Immunology", W. H. Freeman and Co., New York (1980); available 

5 immunoassays are extensively described in the patent and scientific literature, see, for 
example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 
3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 
4,098,876; 4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. 
J., ed. (1984); *TSfucleic Acid Hybridization" Hames, B. D., and Higgins S. J., eds. 

10 (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., Eds. (1984); 
"Animal Cell Culture" Freshney, R. L, ed. (1986); "Immobilized Cells and Enzymes" 
IRL Press, (1986); "A Practical Guide to Molecular Cloning" Perbal, B., (1984) and 
"Methods in Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To 
Methods And Applications", Academic Press, San Diego, CA (1990); Marshak et al., 

15 "Strategies for Protein Purification and Characterization - A Laboratory Course 
Manual" CSHL Press (1996); all of which are incorporated by reference as if fiiUy set 
forth herein. Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided for the 
convenience of the reader. All the information contained therein is incorporated 

20 herein by reference. 

EXAMPLE 1 
Genomic organization of SIM2 
Schematic presentation of SIM2 lung specific transcripts is shown in Figure 1. 
25 Specifically, the genomic alignment of the exons of SEQ ID NOs: 1-3, 7 and 8 is 
shown. The genomic sequence used as a reference is chromosome 21q22,13, starting 
at position 36992386 and terminating at position 37042613. Genomic sequences are 
available at UCSC Genome Bioinformatics database, release version April 2002 
(http://gen6me.ucsc,edu) . 
30 Table 1, below shows the coordinates of the exons on the genomic sequence 

for each SEQ ID NO, correspondingly. 
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Table 1 



SEQIDNO: 


Exon coordinates on 


Relative exon coordinates on 




expressed sequence 


gen am ic sequen ce 


1 


l-lOOl 


12073-13073 


2 


1-312 


12073-12384 


2 


313-404 


13843-13932 


2 


405-513 


21132-21240 


2 


514-599 


24356-24441 


2 


600-621 


27430-27451 


3 


1-517 


1001-1517 


3 


518-1766 


1930-3178 


3 


1767-3143 


5613-6989 


3 


3144-3473 


7432-7761 


3 


3474-3640 


8343-8509 


3 


3541-3895 


9165-9419 


3 


3896-4159 


9589-9852 


3 


4160-4644 


10281-10765 


3 


4645-8296 


11543-15194 


7 


1-231 


1001-1231 


7 


232-314 


10478-10560 


7 


315-404 


13843-13932 


7 


405-513 


21132-21240 


7 


514-599 


24356-24441 


7 


600-799 


27430-27629 


7 


800-906 


32356-32462 


7 


907-1054 


43028-43175 


7 


1055-1223 


44698-44866 


7 


1224-1632 


46039-46447 


7 


1633-3885 


48976-51228 


8 


1-231 


1001-1231 


8 


232-314 


10478-10560 


8 


315-404 


13843-13932 


8 


405-513 


21132-21240 


8 


514-599 


24356-24441 


8 


600-799 


27430-27629 


8 


800-906 


32356-32462 


8 


907-1054 


43028-43175 


8 


1055-1223 


44698-44866 


8 


1224-2823 


46039-46447 



EXAMPLE 2 
Materials and Exprerimental Procedures 
5 RNA preparation - RNA was commercially obtained from Clontech 

(Franklin Lakes, NJ USA 07417, w^vw.clontech.com) or BioChain Inst. Inc. 
(vrvvw^biochain.com) or ABS or Clinomics. Alternatively RNA was purified fi*om 
tissue samples using TRI-Reagent (Molecular Research Center), according to 
Manufacturer's instructions. Tissue samples were obtained from subjects or fi-om 
10 postmortem. Total RNA samples were treated with DNasel (Ambion) then purified 
using RNeasy columns (Qiagen). 
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Lung — Lu_lN-Lu_5N sample was prepared from commercially available 
normal lung RNAs (BioChain, Cat. No. CDP-061010, Lot Nos: A503205, A503384, 
A503385, A503204, A503206). RT Lu_6N was prepared from a pool of 6 normal lung 
RNAs (BioChain, Cat. No. CDP-061010, Lot No. A409363). 

5 Lu 7C- Lu 16C samples were prepared from lung adenocarcinoma RNAs: 

samples Lu_7C- Lu_10C were prepared from commercially available RNAs 
(BioChain, Cat. No. CDP-064004A, Lot Nos: A504117, A504119, A504116, 
A504118), and samples Lu_llC- Lu_16C were prepared from RNA purified from 
tissue samples of subjects. 

10 RTs Lu_17C- Lu_31C were prepared from squamous cell carcinoma RNAs: 

samples Lu_17C- Lu 22C were prepared from commercial RNAs (BioChain, Cat. No. 
CDP-064004B, Lot Nos: A503187, A503386, A503387, A503183, A411075), sample 
Lu_27C was prepared from commercial RNA (Clontech, Cat No: 64013-1), samples 
Lu_28C - Lu_30C were prepared from commercial RNAs (BioChain, Cat. No. CDP- 

15 064004, Lot Nos: A409017, A409091, A408175), and samples Lu_23C- Lu_26C and 
Lu_31C were prepared from RNA purified from subjects tissue samples. 

Lu_32C- Lu_35C were prepared from commercial small cell carcinoma RNAs 
(BioChain, Cat. No. CDP-064004D, Lot Nos: A504115, A501390, A501389, 
A501391). 

20 Lu_36C- Lu_37C were prepared from commercial large cell carcinoma RNAs 

(BioChain, Cat. No. CDP-064004C, Lot Nos: A5041 13, A5041 14). 

Lu_38C was prepared from commercial alveolus cell carcinoma RNAs 
(BioChain, Cat. No. CDP-064004, Lot Nos: A409089). 

RT Lu_39C was prepared from lung carcinoma RNA purified from subject 
25 tissue sample (with no forther subcaracterization). 

Lu_40 H1299 was prepared from RNA purified from NCI_H1299 cell line 
non-small cell carcinoma (ATCC Catalog No: CRL-5803). 

SG_41 was prepared from commercial normal salivary gland RNA (pool of 24) 
(Clontech, Cat No: 641 10-1). 
30 Lu_16N and Lu_25N were prepared from RNA purified from subjects normal 

tissue samples matched to the cancer samples Lu_16C and Lu_25C. 

Colon - RTs marked as "Col_xN'* (Col__2N- Col_276N) were prepared from 
normal colon RNAs. Samples Col_22N and Col_23N were prepared from commercial 
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RNAs (BioChain, Cat. No. CDP-064007, Lot Nos: A501132, A501130), and the rest 
were prepared from RNA purified from normal subjects tissue samples. Normal pool 
sample was prepared form comercial RNA pool of normal colon (BioChain, Cat. No. 
CDP.061003, Lot Nos: A41 1078). 

5 RTs Col_2C- Col_23C were prepared from colon adenocarcinoma RNAs, 

matched to the normal samples Col_2N- Col_23N. Samples Col_22C and Col_23C 
were prepared from commercial RNAs BioChain, Cat. No. CDP-064007, Lot Nos: 
A501131, A501129), and the rest were prepared from RNA purified from subjects 
colon adenocarcinoma cancer tissue samples. 

10 Colon cell lines - Col-SW620 is epithelial colorectal adenocarcinoma from 

metastatic lymph node. Dukes C (ATCC Catalog No: CCL-227), Col-SW480 is 
epithelial colorectal adenocarcinoma. Dukes B (ATCC Catalog No: CCL-228) and 
Col-DLDl is epithelial colorectal adenocarcinoma. Dukes C (ATCC Catalog No: 
CCL-221). 

15 Ovary — Ovarian RNA was generated as described in Table 2, below. 



Table 2 



Serial number 


Lot number 


Source 


Tissue 


Pathology (grade) 


I'Pap Adeno G3 


ILS-1406 


ABS 


ovary 


papillary adenocarcinoma (3) 


2' Pap Adeno G2 


ILS-1408 


ABS 


ovary 


papillary adenocarcinoma (2) 


3 -Pap Adeno G2 


ILS-1431 


ABS 


ovary 


papillary adenocarcinoma (2) 


4'Pap CystAde G2 


ILS-7286 


ABS 


ovary 


papillary cystadenocarcinoma (2) 


5-Adeno G3 


99-12-G432 


GOG 


ovary 


adenocarcinoma (3) 


6'Adeno G3 


AO 106 


ABS 


ovary 


adenocarcinoma (3) 


7'Adeno G3 


IND-00375 


ABS 


ovary 


adenocarcinoma (3) 


8'Adeno G3 


A501113 


BioChain 


ovary 


adenocarcinoma (3) 


9'AdenoG3 


99-06-G901 


GOG 


ovary^ 


adenocarcinoma (maybe serous) 

(?) 


lO'Adeno G3 


A407069 


Biochain 


ovary 


adenocarcinoma (3) 


ll'Adeno G3 


A407068 


Biochain 


ovary 


adenocarcinoma (3) 


12-Adeno G3 


A406023 


Biochain 


ovary 


adenocarcinoma (3) 


l3'Adeno G3 


94-05-7603 


GOG 


right ovary 


metastasis adenocarcinoma (3) 


14-Adeno G2 


A501111 


BioChain 


ovary 


adenocarcinoma (2) 


1 5-Carcinoma G3 


A407065 


BioChain 


ovary 


carcinoma (3) 


Id-Carcinoma 


109038' 


Clontech 


ovary 


carcinoma NOS 


17'Muc Adeno G3 


A504084 


BioChain 


ovary 


mucinous adenocarcinoma (3) 


18-Muc Adeno G3 


A504083 


BioChain 


ovary 


mucinous adenocarcinoma (3) 


19-Muc Adeno G3 


A504085 


BioChain 


ovary 


mucinous adenocarcinoma 


20'Pap Afuc CysiAde 


USA-00273 


ABS 


ovary 


papillary mucinous 
cystadenocarcinoma 


21-Afuc CystAde G2'3 


95-10-G02C 


GOG 


ovary 


mucinous cystadenocarcinoma (2- 
3) 
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22'Muc CystAde G2 i 


\0139 1 


\BS ( 


Dvary i 


Tiucinous cystadenocarcinoma (2) 


23-Muc CystAde G3 ( 


}0187 i 


ABS ( 


1 

Dvary ^ 


Tiucinous cystadenocarcinoma 
ivith low malignant (3) 


24-Pap Sera Adeno G3 < 


ZOOl-07- 
3801 ( 


GOG { 


avary 


papillary serous adenocarcinoma 

[3) 


25'Pap Sera Adeno G3C 


M0021 


ABS ( 


1 

avary 


papillary serous adenocarcinoma 
[3C) 


26-Sero Adeno G3 


2001-12- 
G035 


GOG 


right ovary 


serous adenocarcinoma (3) 


27'Pap Sero Card G3 


2001-08- 
GOl 1 


GOG 


ovary 


papillary serous carcinoma (3) 


2*- Pap Sero CystAde G3 


A503176 


BioChain 


ovary 


serous papillary 
cystadenocarcinoma (3) 


29' Pap Sero CystAde G3 


93-09-4901 


GOG 


ovary 


serous papillary 
cystadenocarcinoma (3) 


30' Pap Sero CystAde G3 


A503175 


BioChain 


ovary 


serous papillary 
cystadenocarcinoma ( 1 ) 


31'Pap Endo Adeno G3C 


95-04-2002 


GOG 


ovary 


papillary endometrioid 
adenocarcinoma (3C) 


32-Endo Adeno G2 


94-08-7604 


GOG 


right ovary 


endometrioid adenocarcinoma (2) 


33-Endo Adeno Gl-2 


2000-09- 
G621 


GOG 


ovary 


endometrial adenocarcinoma (1-2) 


34-Mix Sero/Endo G3 


2002-05- 
G513 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma (3) 


35'Mix Sero/Endo G3 


2002-05- 
G509 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma of mullerian (3) 


36''Mix Sero/Endo G3 


2001-12- 
G037 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma (3) 


3 7'Mix Sero/Endo G2 


95-11-G006 


GOG 


ovary,endometi 
ium 


papillary serous and endometrioid 
cystadenocarcinoma (2) 


38'Mix Sero/Muc/Endo G2 


98-03-G803 


GOG 


ovary 


mixed epithelial 
cystadenocarcinoma with 

mucinous, endometrioid, 
squamous and papillary serous(2) 




98-08-GOOl 


GOG 


ovary 


epithelial adenocarcinoma of* 
borderline malignancy 


40'Clear cell Adeno G3 


2001-10- 

G002 


GOG 


ovary 


clear cell adenocarcinoma (3) 


41-Clear cell Adeno 


2001-07- 
G084 


GOG 


ovary 


clear cell adenocarcinoma 


42"^ 


A503274 


BioChain 


ovary 


Normal 


43'N 


A504086 


BioChain 


ovary 


Normal 


44'N 


061P43A 


Ambipn 


ovary 


Noiimal 


45-N 


A504087 


BioChain 


ovary 


Normal 


46'NM14 


A501112 


BioChain 


ovary 


Normal (matched tumor A501 111] 


47'NM8 


A50H14 


BioChain 


ovary 


Normal (matched tumor A501 113] 


48'NM38 


98-03- 
G803N 


GOG 


ovary 


Normal (matched tumor 98-03- 
G803) 


49'NM39 


98-08- 
GOOIN 


GOG 


ovary 


Normal (matched tumor 98-08- 
GOOl) . 



RT reaction with oligo-dT - Reverse transcription was effected using 2 jig of 
total RNA, in a 20 \x\ reaction, including 200 units of Superscript 11 Reverse 
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Transcriptase (Bibco/BRL) in the buffer supplied by the manufacturer, 500 pmol of 
oligo(dT)25 (Promega Corp. Madison WI, USA), and 40 units of RNasin (Promega 
Corp. Madison WI, USA). 

Real Time PCR - 5ul RT reaction products, diluted in final reaction volume of 
20ul was used for amplification. Specific oligonucleotides (SEQ ID NO:4 and SEQ 
ID NO:5) were used as primers. The ABI Prism 7000 Sequence Detection System was 
used for cycling. The reaction was effected using SYBR GreenPCR Master Mix 
(Applied Biosystems). The cycle in which the reactions achieved a threshold level 
(Ct) of fluorescence was registered and served to calculate the initial transcript 
quantity in the RT reaction. Control PCR reactions were effected on the same RT 
sample, using PCR primers specific to the house keeping gene ribosomal protein S27a 
(RPS27A). An amplicon fragment thereof (SEQ ID NO: 23) was generated using the 
primers set forth in SEQ ID NOs: 21 and 22. For each primer set, the value of each 
PCR reaction was divided into the value of one of the RTs (a pool of normal lung 
transcripts from BioChain). In order to normalize the results, the ratio for each PCR 
reaction was then divided by the house keeping gene ratio for the same RT. 

RT PCR for Examples 6-9 - 1 |ig of treated RNA was mixed with 150 ng 
Random Hexamer primers (Invitrogen) and 500 jaM dNTP in total volume of 15.6 ^il. 
The mixture was incubated for 5min at 65*^C and then quickly chilled on ice. Then, 5|xl 
5 X Superscriptll first strand buffer (Invitrogen), 2.4 jxl 0.1 M DTT and 40 units 
Rnasin (Promega) were added, and the mixture was incubated for 10 min at 25 ^'C, 
followed by further incubation at 42 °C for 2 min. Then, 1 ^il (200 units) of 
Superscriptll (Invitrogen) was added and the reaction (final volume of 25 jal) was 
incubated for 50 min at 42 °C and then inactivated at 70 ""C for 15 min. The resulting 
cDNA was diluted 1 :20 in 10 niM TO.IE. 

Real-Time RT-PCR used for analyzing expression pattern of SEQ ID NOs. 
1, 9, 18 and 19 in different ovary and lung samples (Examples 7-9, Figures 8-1 1)- 
5\i\ of diluted cDNA prepared with random primers were used as a template in Real- 
Time PCR reactions using the SYBR Green I assay (PE Applied Biosystem) with 
specific primers. The amplification stage Was effected as follows, 50 for 2 min, 95 
X for 10 min, 95 X for 15 sec, followed by 60 ''C for 1 min. Detection was effected 
using PE Applied Biosystem SDS 7000. The. cycle in which the reactions achieved a 
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threshold level (Ct) of fluorescence was registered and served to calculate the initial 
transcript quantity in the RT reaction. The quantity was calculated using a standard 
curve created using serial dilutions of either a purified amplicon product or reverse 
transcription (RT) reaction prepared from RNA mix purified from 5 cell-lines 

5 (HCTl 16, H1299, DU145, MCF7, ES-2). To minimize inherent differences in the RT 
reaction, the resulting quantity was normalized to the geometric mean of the 
quantities of several housekeeping genes (different HSKP genes were used for the 
different tissue panels). 

RT'PCR analysis used for analyzing expression pattern of SEQ ID NOs, 18 

10 and 19 in different breast samples (examples 10,11, Figures 12,13 ) — 5 jil of diluted 
cDNA prepared with random primers were used as a template in RT-PCR reactions 
using the SYBR Green I assay (PE Applied Biosystem) with specific primers. The 
amplification stage was effected as follows, 50 ''C for 2min, 95 for 10 min, 95 ^'C 
for 15 sec, followed by 60 ®C for 1 min. PCR products were analyzed by 1.8 % 

15 agarose gels. 

EXAMPLE 3 

Expression pattern of a SIM2 derived fragment (SEQ ID NO:20) in normal and 

malignant lung samples 

20 The expression level of a S1M2 derived fragment corresponding to SEQ ID 

NO:20, which is a fragment of SEQ ID NO: 1 (nucleotide coordinates 125-225, see 
Figure 1) was determined using the primers set forth in SEQ ID NOs: 4 and 5 
(designated as primers 1 and 2 in Figure 1) and measured by real time PCR. The 
expression of the housekeeping gene RPS27A (GenBank Accession No. NM_002954, 

25 SEQ ID NO: 23) was determined similarly using primers: SEQ ID NOs: 21, 22. 
Expression values were first normalized to a housekeeping gene. The expression was 
then calculated relative to a pool of normal lung samples (Lu_6N). 

As shown in Figure 2, representing two duplicates of the same experiment, the 
expression of the SIM2 fragment (SEQ ID NO: 1) in the normal samples was 

30 significantly lower than in the tumor samples. Interestingly, high expression was 
found in adenocarcinoma samples (4 out of the 9 samples) and squamous cell 
carcinoma samples (7 out of the .14 samples). The expression in these samples was 
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between 10 and 200 fold higher as compared to the expression pattern in normal 
samples. 

EXAMPLE 4 

Expression pattern of a SIM2 derived fragment (SEQ ID NO:20) in normal and 

malignant colon samples 
The expression levels of a S1M2 derived fragment corresponding to SEQ ED 
NO. 20 (a portion of SEQ ID NO: 1 ampUfied by the primers set forth in SEQ ID NOs. 
4-5) as well as the housekeeping gene RPS27A (GenBank Accession No. 
NM_002954, SEQ ID NO: 23) using primers: SEQ ID NOs: 21, 22) were measured by 
real time PGR. Each value was first normalized to the housekeeping gene. The 
expression level was then calculated relative to a pool of RNA from normal colon 
(Col-normal pool). 

As shown in Figure 3, representing two duplicates of the same experiment, the 
expression in most of the adenocarcinoma and cell lines samples was higher than in 
the normal samples, 

EXAMPLE 5 

Expression pattern of a SIM2 derived fragment (SEQ ID NO: 9) in normal and 

malignant lung samples 

Expressions of long (SEQ ID NO: 7) and short (SEQ ID NO: 8) SIM variants 
was measured by real time PGR using a sequence fragment (SEQ ID NO: 9, and the 
primers set forth in SEQ ID NOs: 10 and 1 1, designated as primers 9 and 10 in Figure 
1, respetively) which is shared by both sequences. Each expression was first 
normalized to RPS27A (SEQ ED NO: 23; using primers: SEQ ID NOs: 21, 22) 
(Figures 4 through 7). Then the expression level was calculated relative to a pool of 
normal lungs (Lu_6N). 

As shown in Figures 4 through 7, SIM2 expression in normal samples was very 
low. High expression was found in adenocarcinoma samples and squamous cell 
carcinoma samples. Note that over-expression in tumor samples was 10 to 2000 fold 
relatively to the expression of SIM2 in normal samples. Figures 5 and 7 show a better 
resolution pattern of SIM2 expression on a scale of 0-200.. 



56 

EXAMPLE 6 

Expression of SIM2 derived fragments in normal and cancerous ovary tissues 
The expression of two different SIM2-derived sequences (SEQ ID NOs:20 and 
9) and three housekeeping genes - PBGD (GenBank Accession No: HSPBGDR, SEQ 

5 ID NO: 32; using the primers set forth in SEQ ID NOs: 30, 31), ATP-6-syn (GenBank 
Accession No: NM_I733702, SEQ ID NO: 26; using the primers set forth in SEQ ID 
NOs: 24, 25), 18s ribosomal RNA (GenBank Accession No: HSRRN18S, SEQ ID 
NO: 29; using the primers set forth in SEQ ID NOs: 27, 28) were measured by real 
time PGR. In each RT sample, the expression of SIM2 sequences was. normalized to 

10 the geometric mean of the quantities of three housekeeping genes PBGD, ATP-6-syn, 
18s ribosomal RNA, as detailed in Example 2, hereinabove. The normalized quantity 
of each RT sample was then divided by the normalized quantity of a normal sample 
(No. 45, Table 2 above). 

As shown in Figure 8, SIM2 expression in normal samples (samples nos. 42- 

15 49, Table 2 above) was significantly lower than in the cancer samples. Notably, the 
highest expression of SIM2 was found in papilary serous (carcinoma or 
adenocarcinoma or cystadenocarcinoma) samples (samples 24, 27, 29, Table2). 

EXAMPLE 7 

20 Expression of SIM2 long variant - derived fragment (SEQ ID NO: 18) in normal and 

cancerous ovary tissues 
Expression of SIM2 long variant (GenBank Accession No: gi71 08363, SEQ ID 
NO: 7) was measured by real time PGR using a fragment (SEQ ID NO: 18 
corresponding to nucleotide coordinates 1551-1670 of SEQ ID NO: 7 using the 

25 primers set forth in SEQ ID NOs: 14- 15, designated as primers 12 and 13 in Figure 1, 
respectively). In addition the expression of two housekeeping genes - PBGD 
(GeriBank Accession No: HSPBGDR, SEQ ID NO: 32; using the primers set forth in 
SEQ ID NOs: 30-31) and HPRTl (GenBank Accession No: GI_32449, SEQ ID NO: 
35; using the primers set forth SEQ ID NOs: 33-34), was measured by real time PGR. 

30 In each RT sample, the expression of SIM2 sequerices was normalized to the 
geometric mean of the quantities of the housekeeping genes as detailed in Example 2, 
hereinabove. The normalized quantity of each RT sample \yas then divided by the 
averaged quantity of the normal samples (No. 42-48, Table 2). 
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As shown in Figure 9, SIM2 expression in normal samples (sample Nos. 42- 
48, Table 2) was significantly lower than in the cancer samples. Notably, the highest 
expression of S1M2 was found in 4 out of 6 papillary serous samples (carcinoma or 
adenocarcinoma or cystadenocarcinoma). 

5 

EXAMPLE 8 

Expression of SIM2long variant - derived fragment (SEQ ID NO:18) in normal and 

cancerous lung tissues 
Expression of SIM2 long variant (SEQ ED NO: 7) was measured in normal and 

10 cancerous lung tissues (see Table 3, below) by real time PGR using a sequence 
fragment (SEQ ID NO: 18 corresponding to nucleotide coordinates 1551-1670 of SEQ 
ID NO: 7 using the primers set forth in SEQ ID NOs: 14-15, designated as primers 12 
and 13 in Figure 1, respectively). In addition the expression of three housekeeping 
genes - SDHA (GenBank Accession No: NM_004168, SEQ ID NO: 38 was measured 

15 using the primers set forth in SEQ ID NOs: 36 and 37), RPS27A (GenBank Accession 
No: NM_002954, SEQ ID NO: 23 was measured using the primers set forth in SEQ 
ID NOs: 21, 22), PBGD (GenBank Accession No: HSPBGDR, SEQ ED NO: 32; using 
the primers set forth in SEQ ID NOs: 30, 31), was measured by real time PCR. In 
each RT sample, the expression of SIM2 sequences was normalized to the geometric 

20 mean of the quantities of the housekeeping genes as detailed in Example 2, 
hereinabove. The normalized quantity of each RT sample was then divided by the 
averaged quantity of the normal samples (No. 46-54, Table 3). 



Tables 



Serial number 


Lot number 


Pathology 


Source 


1 


A504117 


Adenocarcinoma 


Biochain 


2 


A504118 


Adenocarcinoma 


Biochain 


3 


CG-200 


Adenocarcinoma 


Ichilov 


4 


Coin-02-43T-M-2237T 


Adenocarcinoma Grade 1 


Clinomics 


5 


Com-02-49T-M-22 14T 


Adenocarcinoma Grade 2 


Clinomics 


6 


Com-02-55T-M-2206T 


Adenocarcinoma Grade 3 


Clinomics 


7 


Com-02-57T-M-2285T 


Adenocarcinoma Grade 4 . 


Clinomics 


8 


Com-02-59T-M-2261T 


Adenocarcinoma Grade 2 


Clinomics 


9 


Com-02-4 1 T-M-2269T 


Adenocarcinorna Grade 3 ' 


Clinomics 


11 


Com-02-53T-M-2221T 


Adenocarcinoma Grade 1 . 


Clinomics 


12 


A504119 


Moderately adenocarcinorna . 


Biochain 


13 


A504116 


moderately to poorly adenocarcinoma 


Biochain 


14 


CG-lll 


Adenocarcinoma 


Ichilov 
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15 


CG-244 


Bronchioloalveolar adenocarcinoma 


[chilov 


16 


M09091 


Moderately squamous 


Biochain 


17 


A503183 


moderately squamous 


Biochain 


18 


A503387 


moderately to poorly squamous 


Biochain 


19 


A408175 


Squamous 


Biochain 


20 


A501121 


Squamous 


Biochain 


21 


A503187 


Squamous 


Biochain 


22 


A503386 


Squamous 


Biochain 


23 


CG-109(1) 


Squamous 


Ichilov 


24 


CG-123 


Squamous 


Ichilov 


25 


CG-204 


Squamous 


Ichilov 


26 


Com-02-47T-M-2208T 


Squamous Grade 3 


Clinomics 


27 


Com.02-61T.M-2215T 


Squamous Grade 2 


Clinomics 


28 


Com-02-63T>M-2216T 


Squamous Grade 3 


Clinomics 


29 


Com-02-65T-M-2239T 


Squamous Grade 1 


Clinomics 


30 


A501389 


Small cell 


Biochain 


31 


A501390 


Small cell 


Biochain 


32 


A501391 


Small cell 


Biochain 


33 


A504115 


Small cell 


Biochain 


34 


Com-02-45T-M-22 1 7T 


Small cell Grade 2 


Clinomics 


35 


Com-02-69T-M.2210T 


Small cell Grade 3 


Clinomics 


36 


Com-02-71T-M-22I8T 


Small cell Grade 2 


Clinomics 


37 


Com-02-73T-M-2235T 


Small cell Grade 1 


Clinomics 


38 


A504113 


large cell 


Biochain 


39 


A504114 


large cell 


Biochain 


40 


Com-02-75T-M-22 1 2T 


Large cell Grade 3 


Clinomics 


41 


Com-02.77T-M-2257T 


Large cell Grade 4 


Clinomics 


42 


Com-02-79T-M-2241T 


Large cell Grade 2 


Clinomics 


43 


Com-02- 1 63T-M-2290T 


Large cell Grade 1 


Clinomics 


44 


A501123 


Moderately alveolus carcinoma 


Biochain 


45 


A501221 


Alveolus carcinoma 


Biochain 


46 


A501 124 


Normal 


Biochain 


47 


A503205 


Normal 


Biochain 


48 


A503206 


Normal 


Biochain 


49 


A503384 


Normal 


Biochain 


51 


Com-02-44N-M-2237N 


normal M4 


Clinomics 


53 


Com-02-42N-M-2269N 


normal M9 


Clinomics 


54 


Com-02-48N-M-2208N 


normal M26 


Clinomics 



Table 3 ConL 



As shown in Figure 10, SIM2 expression in normal samples (sample Nos, 46- 
54, Table 3) was significantly lower than in the cancer isamples. hiterestingly, high 
expression was found in adenocarcinoma samples (7 out of the 15 samples) and 
squamous cell carcinoma samples (9 out of the 14 sarnples). 
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EXAMPLE 9 

Expression of SIM2short variant- derived fragment (SEQ ID NO:19) in normal and 

cancerous ovary tissues 

Expression of SEVI2 short variant (GenBank Accession No: gi7108361, SEQ 
ID NO: 8) was measured by real time PGR using the sequence fragment set forth in 
SEQ ID NO: 19 (nucleotide coordinates 1224-2324 of SEQ ID NO: 7, amplified using 
the primers set forth in SEQ ID NOs: 16-17 designated as primers 14 and 15 of Figure 
1, respectively). In addition the expression of two housekeeping genes — PBGD was 
measured by real time PGR as described above. In each RT sample, the expression of 
SIM2 sequences was normalized to the geometric mean of the quantities of the 
housekeeping genes as detailed in Example 2, hereinabove. The normalized quantity 
of each RT sample was then divided by the averaged quantity of the normal samples 
(No. 42-49, Table 2 above). 

As shown in Figure 11, SIM2 expression in normal samples (sample Nos. 42- 
49, Table 2 above) was significantly lower than in the cancer samples. 

EXAMPLE 10 

Expression of SIM2 long variant - derived fragment (SEQ ID NO:18) in normal 

and cancerous breast tissues 
Expression of SIM2 short variant (SEQ K) NO: 7) was evaluated by RT-PGR 
using a sequence fragment (SEQ ID NO: 18, described above). As shown in Figure 
12, SIM2 expression in most tumor samples was higher than in the normal samples 
(Table 4, below) . 

EXAMPLE 11 

Expression of SIM2 short variant - derived fragment (SEQ ID NO: 19) in normal 

and cancerous breast tissues 
Expression of SIM2 short variant (SEQ ID NO: 8) was evaluated by RT-PGR 
of a sequence fragment (SEQ ID NO: 19, primers 14 and 15, see Figure 1) normal and 
cancerous breast samples (see Table 4, below) 
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Table 4 



Serial number 


Cat No. 


Pathology 


Source 


1 


M-0140T 


DCIS Grade 1 


clinomics 


2 


M-OllOT 


DCIS Grade 2 


clinomics 


3 


M-0150T 


IDC Grade 1 


clinomics 


4 


M-2159T 


IDC Grade 1 


clinomics 


5 


M-2168T 


IDC Grade 1 


clinomics 


6 


7238T 


IDC-Gl 


ABS 


7 


7263T 


IDC - G2 


ABS 


8 


M-0113T 


IDC Grade 2 


clinomics 


9 


M-0160T 


IDC Grade 2 


clinomics 


10 


M-2160T 


IDC Grade 2 


clinomics 


11 


M-2175T 


roc Grade 2 


clinomics 


12 


1432T 


IDC - G2 


ABS 


13 


A0133T 


IDC - G2 


ABS 


14 


A0135T 


IDC - G2 


ABS 


15 


7259T 


IDC - G2 


ABS 


16 


20032T 


roc-G2 


ABS 


17 


20036T 


IDC - G2(3) 


ABS 


18 


M-2169T 


IDC Grade 2 


clinomics 


19 


M-2162T 


roc Grade 2 


clinomics 


20 


M-OlllT 


roc Grade 3 


clinomics 


21 


M-0il2T 


IDC Grade 3 


clinomics 


22 


M-0114T 


IDC Grade 3 


clinomics 


23 


M-0115T 


roc Grade 3 


clinomics 


24 


M-0180T 


IDC Grade 3 


clinomics 


25 


M-2176T 


IDC Grade 3 


clinomics 


26 


7249T 


IDC - G3 


ABS 


27 


20072T 


roc-G3 


ABS 


28 


M-2161T 


IDC Grade 3 


clinomics 


29 


M-2170T 


roc Grade 3 


clinomics 


30 


M-2177T 


roc Grade 3 


clinomics 


31 


CG-154 


IDC 


Ichilov 


32 


7116T 


Mucinous carcinonia 


ABS 


33 


M-0140N 


normal matched to IT 


clinomics 


34 


M-OllON 


normal matched to 2T 


clinomics 


35 


7238N 


normal matched to 6T 


ABS 


36 


7263N 


normal matched to 7T 


ABS 


37 


M-0150N 


normal matched to 3T 


clinomics 


38 


7116N 


normal matched to 32T 


ABS 


39 


7259N 


normal mathed to 15T 


ABS 


40 


1432N 


normal mathed to 12T 


ABS 


41 


7249N 


normal mathed to 26T 


ABS 


42 


2003 IT 


IDC Grade 3 


ABS 



roC= invasive Ductal Carcinoma 



DCIS= Ductal Carcinoma In Situ 
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As shown in Figure 13, SIM2 expression in normal samples (sample Nos. 33- 
41, Table 4 above) was significantly lower than in the cancer samples. 

It is appreciated that certain features of the invention, which are, for clarity, 
5 described in the context of separate embodiments, may also be provided in 
combination in a single embodiment. Conversely, various features of the invention, 
which are, for brevity, described in the context of a single embodiment, may also be 
provided separately or in any suitable subcombination. 



10 Although the invention has been described in conjunction with specific 

embodiments thereof, it is evident that mciny alternatives, modifications and variations 
will be apparent to those skilled in the art. Accordingly, it is intended to embrace all 
such altematives, modifications and variations that fall within the spirit and broad 
scope of the appended claims. All publications, patents and patent applications 

15 mentioned in this specification are herein incorporated in their entirety by reference 
into the specification, to the same extent as if each individual publication, patent or 
patent application was specifically and individually indicated to be incorporated herein 
by reference. In addition, citation or identification of any reference in this application 
shall not be construed as an admission that such reference is available as prior art to 

20 the present invention. 



